Modified nucleotides

ABSTRACT

The invention provides modified nucleotide or nucleoside molecule comprising a purine or pyrimidine base and a ribose or deoxyribose sugar moiety having a removable 3′-OH blocking group covalently attached thereto, such that the 3′ carbon atom has attached a group of the structure —O-Z wherein Z is any of —C(R′)2-O—R″, —C(R′)2-N(R″)2, —C(R′)2-N(H)R″, —C(R′)2-S—R″ and —C(R′)2-F, wherein each R″ is or is part of a removable protecting group; each R′ is independently a hydrogen atom, an alkyl, substituted alkyl, arylalkyl, alkenyl, alkynyl, aryl, heteroaryl, heterocyclic, acyl, cyano, alkoxy, aryloxy, heteroaryloxy or amido group, or a detectable label attached through a linking group; or (R′)2 represents an alkylidene group of formula ═C(R′″)2 wherein each R′″ may be the same or different and is selected from the group comprising hydrogen and halogen atoms and alkyl groups; and wherein said molecule may be reacted to yield an intermediate in which each R″ is exchanged for H or, where Z is —C(R′)2-F, the F is exchanged for OH, SH or NH2, preferably OH, which intermediate dissociates under aqueous conditions to afford a molecule with a free 3′OH; with the proviso that where Z is —C(R′)2-S—R″, both R′ groups are not H.

The invention relates to modified nucleotides. In particular, this invention discloses nucleotides having a removable protecting group, their use in polynucleotide sequencing methods and a method for chemical deprotection of the protecting group.

Advances in the study of molecules have been led, in part, by improvement in technologies used to characterise the molecules or their biological reactions. In particular, the study of the nucleic acids DNA and RNA has benefited from developing technologies used for sequence analysis and the study of hybridisation events.

An example of the technologies that have improved the study of nucleic acids is the development of fabricated arrays of immobilised nucleic acids. These arrays consist typically of a high-density matrix of polynucleotides immobilised onto a solid support material. See, e.g., Fodor et al., Trends Biotech. 12:19-26, 1994, which describes ways of assembling the nucleic acids using a chemically sensitized glass surface protected by a mask, but exposed at defined areas to allow attachment of suitably modified nucleotide phosphoramidites. Fabricated arrays can also be manufactured by the technique of “spotting” known polynucleotides onto a solid support at predetermined positions (e.g., Stimpson et al., Proc. Natl. Acad. Sci. USA 92:6379-6383, 1995).

Sequencing by synthesis of DNA ideally requires the controlled (i.e. one at a time) incorporation of the correct complementary nucleotide opposite the oligonucleotide being sequenced. This allows for accurate sequencing by adding nucleotides in multiple cycles as each nucleotide residue is sequenced one at a time, thus preventing an uncontrolled series of incorporations occurring. The incorporated nucleotide is read using an appropriate label attached thereto before removal of the label moiety and the subsequent next round of sequencing. In order to ensure only a single incorporation occurs, a structural modification (“blocking group”) of the sequencing nucleotides is required to ensure a single nucleotide incorporation but which then prevents any further nucleotide incorporation into the polynucleotide chain. The blocking group must then be removable, under reaction conditions which do not interfere with the integrity of the DNA being sequenced. The sequencing cycle can then continue with the incorporation of the next blocked, labelled nucleotide. In order to be of practical use, the entire process should consist of high yielding, highly specific chemical and enzymatic steps to facilitate multiple cycles of sequencing.

To be useful in DNA sequencing, nucleotide, and more usually nucleotide triphosphates, generally require a 3′OH-blocking group so as to prevent the polymerase used to incorporate it into a polynucleotide chain from continuing to replicate once the base on the nucleotide is added. There are many limitations on the suitability of a molecule as a blocking group. It must be such that it prevents additional nucleotide molecules from being added to the polynucleotide chain whilst simultaneously being easily removable from the sugar moiety without causing damage to the polynucleotide chain. Furthermore, the modified nucleotide must be tolerated by the polymerase or other appropriate enzyme used to incorporate it into the polynucleotide chain. The ideal blocking group will therefore exhibit long term stability, be efficiently incorporated by the polymerase enzyme, cause total blocking of secondary or further incorporation and have the ability to be removed under mild conditions that do not cause damage to the polynucleotide structure, preferably under aqueous conditions. These stringent requirements are formidable obstacles to the design and synthesis of the requisite modified nucleotides.

Reversible blocking groups for this purpose have been described previously but none of them generally meet the above criteria for polynucleotide, e.g. DNA-compatible, chemistry.

Metzker et al., (Nucleic Acids Research, 22(20): 4259-4267, 1994) discloses the synthesis and use of eight 3′-modified 2-deoxyribonucleoside 5′-triphosphates (3′-modified dNTPs) and testing in two DNA template assays for incorporation activity. The 3′-modified dNTPs included 3′allyl deoxyriboadenosine 5′-triphosphate (3′-allyl dATP). However, the 3′allyl blocked compound was not used to demonstrate a complete cycle of termination, deprotection and reinitiation of DNA synthesis: the only test results presented were those which showed the ability of this compound to terminate DNA synthesis in a single termination assay, out of eight such assays conducted, each conducted with a different DNA polymerase.

WO02/29003 (The Trustees of Columbia University in the City of New York) describes a sequencing method which may include the use of an allyl protecting group to cap the 3′-OH group on a growing strand of DNA in a polymerase reaction. The allyl group is introduced according to the procedure of Metzker (infra) and is said to be removed by using methodology reported by Kamal et al (Tet. Let, 40, 371-372, 1999).

The Kamal deprotection methodology employs sodium iodide and chlorotrimethylsilane so as to generate in situ iodotrimethylsilane, in acetonitrile solvent, quenching with sodium thiosulfate. After extraction into ethyl acetate and drying (sodium sulfate), then concentration under reduced pressure and column chromatography (ethyl acetate:hexane; 2:3 as eluant), free alcohols were obtained in 90-98% yield.

In WO02/29003, the Kamal allyl deprotection is suggested as being directly applicable in DNA sequencing without modification, the Kamal conditions being mild and specific.

While Metzker reports on the preparation of a 3′allyl-blocked nucleotide or nucleoside and WO02/29003 suggests the use of the allyl functionality as a 3′-OH cap during sequencing, neither of these documents actually teaches the deprotection of 3′-allylated hydroxyl group in the context of a sequencing protocol. Whilst the use of an allyl group as a hydroxyl protecting group is well known—it is easy to introduce and is stable across the whole pH range and to elevated temperatures—there is to date, no concrete embodiment of the successful cleavage of a 3′-allyl group under DNA compatible conditions, i.e. conditions under which the integrity of the DNA is not wholly or partially destroyed. In other words, it has not been possible hitherto to conduct DNA sequencing using 3′OH allyl-blocked nucleotides.

The Kamal methodology is inappropriate to conduct in aqueous media since the TMS chloride will hydrolyse preventing the in situ generation of TMS iodide. Attempts to carry out the Kamal deprotection (in acetonitrile) in sequencing have proven unsuccessful in our hands.

The present invention is based on the surprising development of a number of reversible blocking groups and methods of deprotecting them under DNA compatible conditions. Some of these blocking groups are novel per se; others have been disclosed in the prior art but, as noted above, it has not proved possible to utilised these blocking groups in DNA sequencing.

One feature of the invention derives from the development of a completely new method of allyl deprotection. Our procedure is of broad applicability to the deprotection of virtually all allyl-protected hydroxyl functionality and may be effected in aqueous solution, in contrast to the methodology of Kamal et al. (which is effected in acetonitrile) and to the other methods known generally in the prior art which are highly oxygen-and moisture-sensitive. A further feature of the invention derives from the development of a new class of protecting groups. These are based upon acetals and related protecting groups but do not suffer from some of the disadvantages of acetal deprotection known in the prior art.

The allyl deprotection methodology makes use of a water-soluble transition metal catalyst formed from a transition metal and at least partially water-soluble ligands. In aqueous solution these form at least partially water-soluble transition metal complexes. By aqueous solution herein is meant a liquid comprising at least 20 vol %, preferably at least 50%, for example at least 75 vol %, particularly at least 95 vol % and especially greater than above 98 vol %, ideally 100 vol % of water as the continuous phase.

As those skilled in the art will appreciate, the allyl group may be used to protect not only the hydroxyl group but also thiol and amine functionalities. Moreover allylic esters may be formed from the reaction between carboxylic acids and allyl halides, for example. Primary or secondary amides may also be protected using methods known in the art. The novel deprotection methodology described herein may be used in the deprotection of all these allylated compounds, e.g. allyl esters and mono- or bisallylated primary amines or allylated amides, or in the deprotection of allylated secondary amines. The method is also suitable in the deprotection of allyl esters and thioethers.

Protecting groups which comprise the acetal functionality have been used previously as blocking groups. However, removal of such groups and ethers requires strongly acidic deprotections detrimental to DNA molecules. The hydrolysis of an acetal however, results in the formation of an unstable hemiacetal intermediate which hydrolyses under aqueous conditions to the natural hydroxyl group. The inventors have utilised this concept and applied it further such that this feature of the invention resides in utilising blocking groups that include protecting groups to protect intermediate molecules that would normally hydrolyse under aqueous conditions. These protecting groups comprise a second functional group that stabilises the structure of the intermediate but which can be removed at a later stage following incorporation into the polynucleotide. Protecting groups have been used in organic synthesis reactions to temporarily mask the characteristic chemistry of a functional group because it interferes with another reaction.

Therefore, according to a first aspect of the invention there is provided a modified nucleotide or nucleoside molecule comprising a purine or pyrimidine base and a ribose or deoxyribose sugar moiety having a removable 3′-OH blocking group covalently attached thereto, such that the 3′ carbon atom has attached a group of the structure —O-Z

wherein Z is any of —C(R′)₂—O—R″, —C(R′)₂—N(R″)₂, —C(R′)₂—N(H)R″, —C(R′)₂—S—R″ and —C(R′)₂—F,

wherein each R″ is or is part of a removable protecting group;

each R′ is independently a hydrogen atom, an alkyl, substituted alkyl, arylalkyl, alkenyl, alkynyl, aryl, heteroaryl, heterocyclic, acyl, cyano, alkoxy, aryloxy, heteroaryloxy or amido group, or a detectable label attached through a linking group; or (R′)₂ represents an alkylidene group of formula ═C(R′″)₂ wherein each R′″ may be the same or different and is selected from the group comprising hydrogen and halogen atoms and alkyl groups; and

wherein said molecule may be reacted to yield an intermediate in which each R″ is exchanged for H or, where Z is —C(R′)₂—F, the F is exchanged for OH, SH or NH₂, preferably OH, which intermediate dissociates under aqueous conditions to afford a molecule with a free 3′OH;

with the proviso that where Z is —C(R′)₂—S—R″, both R′ groups are not H.

Viewed from another aspect, the invention provides a 3′-O-allyl nucleotide or nucleoside which nucleotide or nucleoside comprises a detectable label linked to the base of the nucleoside or nucleotide, preferably by a cleavable linker.

In a further aspect, the invention provides a polynucleotide comprising a 3′-O-allyl nucleotide or nucleoside which nucleotide or nucleoside comprises a detectable label linked to the base of the nucleoside or nucleotide, preferably by a cleavable linker.

Viewed from a still further aspect, the invention provides a method of converting a compound of formula R—O-allyl, R₂N(allyl), RNH(allyl), RN(allyl) ₂ or R—S-allyl to a corresponding compound in which the allyl group is removed and replaced by hydrogen, said method comprising the steps of reacting a compound of formula R—O-allyl, R₂N(allyl), RNH(allyl), RN(allyl)₂ or R—S-allyl in aqueous solution with a transition metal comprising a transition metal and one or more ligands selected from the group comprising water-soluble phosphine and water-soluble nitrogen-containing phosphine ligands, wherein the or each R is a water-soluble biological molecule.

In a further aspect the invention provides a method of controlling the incorporation of a nucleotide molecule complementary to the nucleotide in a target single-stranded polynucleotide in a synthesis or sequencing reaction comprising incorporating into the growing complementary polynucleotide a molecule according to the invention, the incorporation of said molecule preventing or blocking introduction of subsequent nucleoside or nucleotide molecules into said growing complementary polynucleotide.

In a further aspect, the invention provides a method for determining the sequence of a target single-stranded polynucleotide, comprising monitoring the sequential incorporation of complementary nucleotides, wherein at least one incorporation, and preferably all of the incorporations is of a nucleotide according to the invention as hereinbefore described which preferably comprises a detectable label linked to the base of the nucleoside or nucleotide by a cleavable linker and wherein the identity of the nucleotide incorporated is determined by detecting the label, said blocking group and said label being removed prior to introduction of the next complementary nucleotide.

From a further aspect, the invention provides a method for determining the sequence of a target single-stranded polynucleotide, comprising:

(a) providing a plurality of different nucleotides according to the hereinbefore described invention which nucleotides are preferably linked from the base to a detectable label by a cleavable linker and wherein the detectable label linked to each type of nucleotide can be distinguished upon detection from the detectable label used for other types of nucleotides;

(b) incorporating the nucleotide into the complement of the target single-stranded polynucleotide;

(c) detecting the label of the nucleotide of (b), thereby determining the type of nucleotide incorporated;

(d) removing the label of the nucleotide of (b) and the blocking group; and

(e) optionally repeating steps (b)-(d) one or more times;

thereby determining the sequence of a target single-stranded polynucleotide.

Additionally, in another aspect, the invention provides a kit, comprising:

(a) a plurality of different individual nucleotides of the invention; and

(b) packaging materials therefor.

The nucleosides or nucleotides according to or used in the methods of the present invention comprise a purine or pyrimidine base and a ribose or deoxyribose sugar moiety which has a blocking group covalently attached thereto, preferably at the 3′O position, which renders the molecules useful in techniques requiring blocking of the 3′-OH group to prevent incorporation of additional nucleotides, such as for example in sequencing reactions, polynucleotide synthesis, nucleic acid amplification, nucleic acid hybridisation assays, single nucleotide polymorphism studies, and other such techniques.

Where the term “blocking group” is used herein in the context of the invention, this embraces both the allyl and “Z” blocking groups described herein. However, it will be appreciated that, in the methods of the invention as described and claimed herein, where mixtures of nucleotides are used, these very preferably each comprise the same type of blocking, i.e. allyl-blocked or “Z”-blocked. Where “Z”-blocked nucleotides are used, each “Z” group will generally be the same group, except in those cases where the detectable label forms part of the “Z” group, i.e. is not attached to the base.

Once the blocking group has been removed, it is possible to incorporate another nucleotide to the free 3′-OH group.

The molecule can be linked via the base to a detectable label by a desirable linker, which label may be a fluorophore, for example. The detectable label may instead, if desirable, be incorporated into the blocking groups of formula “Z”. The linker can be acid labile, photolabile or contain a disulfide linkage. Other linkages, in particular phosphine-cleavable azide-containing linkers, may be employed in the invention as described in greater detail.

Preferred labels and linkages included those disclosed in WO 03/048387.

In the methods where nucleotides are incorporated, e.g. where the incorporation of a nucleotide molecule complementary to the nucleotide in a target single stranded polynucleotide is controlled in a synthesis or sequencing reaction of the invention, the incorporation of the molecule may be accomplished via a terminal transferase, a polymerase or a reverse transcriptase.

Preferably, the molecule is incorporated by a polymerase and particularly from Thermococcus sp., such as 9° N. Even more preferably, the polymerase is a mutant 9° N A485L and even more preferably is a double mutant Y409V and A485L.

In the methods for determining the sequence of a target single-stranded polynucleotide comprising monitoring the sequential incorporation of complementary nucleotides of the invention, it is preferred that the blocking group and the label may be removed in a single chemical treatment step. Thus, in a preferred embodiment of the invention, the blocking group is cleaved simultaneously with the label. This will of course be a feature inherent to those blocking groups of formula Z which incorporate a detectable label.

Furthermore, preferably the blocked and labelled modified nucleotide constructs of the nucleotide bases A, T, C and G are recognised as substrates by the same polymerase enzyme.

In the methods described herein, each of the nucleotides can be brought into contact with the target sequentially, with removal of non-incorporated nucleotides prior to addition of the next nucleotide, where detection and removal of the label and the blocking group is carried out either after addition of each nucleotide, or after addition of all four nucleotides.

In the methods, all of the nucleotides can be brought into contact with the target simultaneously, i.e., a composition comprising all of the different nucleotides is brought into contact with the target, and non-incorporated nucleotides are removed prior to detection and subsequent to removal of the label and the blocking group.

The methods can comprise a first step and a second step, where in the first step, a first composition comprising two of the four types of modified nucleotides is brought into contact with the target, and non-incorporated nucleotides are removed prior to detection and subsequent to removal of the label and the blocking group, and where in the second step, a second composition comprising the two nucleotides not included in the first composition is brought into contact with the target, and non-incorporated nucleotides are removed prior to detection and subsequent to removal of the label and blocking group, and where the first steps and the second step can be optionally repeated one or more times.

The methods described herein can also comprise a first step and a second step, where in the first step, a composition comprising one of the four nucleotides is brought into contact with the target, and non-incorporated nucleotides are removed prior to detection and subsequent to removal of the label and blocking group, and where in the second step, a second composition, comprising the three nucleotides not included in the first composition is brought into contact with the target, and non-incorporated nucleotides are removed prior to detection and subsequent to removal of the label and blocking group, and where the first steps and the second step can be optionally repeated one or more times.

The methods described herein can also comprise a first step and a second step, where in the first step, a first composition comprising three of the four nucleotides is brought into contact with the target, and non-incorporated nucleotides are removed prior to detection and subsequent to removal of the label and blocking group and where in the second step, a composition comprising the nucleotide not included in the first composition is brought into contact with the target, and non-incorporated nucleotides are removed prior to detection and subsequent to removal of the label and blocking group, and where the first steps and the second step can be optionally repeated one or more times.

The incorporating step in the methods of the invention can be accomplished via a terminal transferase, a polymerase or a reverse transcriptase as hereinbefore defined. The detectable label and/or the cleavable linker can be of a size sufficient to prevent the incorporation of a second nucleotide or nucleoside into the nucleic acid molecule.

In certain methods described herein for determining the sequence of a target single-stranded polynucleotide, each of the four nucleotides, one of which will be complementary to the first unpaired base in the target polynucleotide, can be brought into contact with the target sequentially, optionally with removal of non-incorporated nucleotides prior to addition of the next nucleotide. Determination of the success of the incorporation may be carried out either after provision of each nucleotide, or after the addition of all of the nucleotides added. If it is determined after addition of fewer than four nucleotides that one has been incorporated, it is not necessary to provide further nucleotides in order to detect the nucleotides complementary to the incorporated nucleotide.

Alternatively, all of the nucleotides can be brought into contact with the target simultaneously, i.e., a composition comprising all of the different nucleotide (i.e. A, T, C and G or A, U, C and G) is brought into contact with the target, and non-incorporated nucleotides removed prior to detection and removal of the label(s). The methods involving sequential addition of nucleotides may comprise a first substep and optionally one or more subsequent substeps. In the first substep a composition comprising one, two or three of the four possible nucleotides is provided, i.e. brought into contact with, the target. Thereafter any unincorporated nucleotides may be removed and a detecting step may be conducted to determine whether one of the nucleotides has been incorporated. If one has been incorporated, the cleavage of the linker may be effected. In this way the identity of a nucleotide in the target polynucleotide may be determined. The nascent polynucleotide may then be extended to determine the identity of the next unpaired nucleotide in the target oligonucleotide.

If the first substep above does not lead to incorporation of a nucleotide, or if this is not known, since the presence of incorporated nucleotides is not sought immediately after the first substep, one or more subsequent substeps may be conducted in which some or all, of those nucleotides not provided in the first substep are provided either, as appropriate, simultaneously or subsequently. Thereafter any unincorporated nucleotides may be removed and a detecting step conducted to determine whether one of the classes of nucleotide has been incorporated. If one has been incorporated, cleavage of the linker may be effected. In this way the identity of a nucleotide in the target polynucleotide may be determined. The nascent polynucleotide may then be extended to determine the identity of the next unpaired nucleotide in the target oligonucleotide. If necessary, a third and optionally a fourth substep may be effected in a similar manner to the second substep. Obviously, once four substeps have been effected, all four possible nucleotides will have been provided and one will have been incorporated.

It is desirable to determine whether a type or class of nucleotide has been incorporated after any particular combination comprising one, two or three nucleotides has been provided. In this way the unnecessary cost and time expended in providing the other nucleotide(s) is obviated. This is not a required feature of the invention, however.

It is also desirable, where the method for sequencing comprises one or more substeps, to remove any unincorporated nucleotides before further nucleotide are provided. Again, this is not a required feature of the invention. Obviously, it is necessary that at least some and preferably as many as practicable of the unincorporated nucleotides are removed prior to the detection of the incorporated nucleotide.

The kits of the invention include: (a) individual nucleotides according to the hereinbefore described invention, where each nucleotide has a base that is linked to a detectable label via a cleavable linker, or a detectable label linked via an optionally cleavable liner to a blocking group of formula Z, and where the detectable label linked to each nucleotide can be distinguished upon detection from the detectable label used for other three nucleotides; and (b) packaging materials therefor. The kit can further include an enzyme for incorporating the nucleotide into the complementary nucleotide chain and buffers appropriate for the action of the enzyme in addition to appropriate chemicals for removal of the blocking group and the detectable label, which can preferably be removed by the same chemical treatment step.

The nucleotides/nucleosides are suitable for use in many different DNA-based methodologies, including DNA synthesis and DNA sequencing protocols.

The invention may be understood with reference to the attached drawings in which:

FIG. 1 shows exemplary nucleotide structures useful in the invention. For each structure, X can be H, phosphate, diphosphate or triphosphate. R₁ and R₂ can be the same or different, and can be selected from H, OH, or any group which can be transformed into an OH, including, but not limited to, a carbonyl. Some suitable functional groups for R₁ and R₂ include the structures shown in FIG. 3 and FIG. 4.

FIG. 2 shows structures of linkers useful in certain aspects of the invention, including (1) disulfide linkers and acid labile linkers, (2) dialkoxybenzyl linkers, (3) Sieber linkers, (4) indole linkers and (5) t-butyl Sieber linkers.

FIG. 3 shows some functional molecules useful in the invention, including some cleavable linkers and some suitable hydroxyl protecting groups. In these structures, R₁ and R₂ may be the same of different, and can be H, OH, or any group which can be transformed into an OH group, including a carbonyl. R₃ represents one or more substituents independently selected from alkyl, alkoxyl, amino or halogen groups. R₄ and R₅ can be H or alkyl, and R₆ can be alkyl, cycloalkyl, alkenyl, cycloalkenyl or benzyl. X can be H, phosphate, diphosphate or triphosphate.

FIG. 4 is a schematic illustration of some of the Z blocking groups that can be used according to the invention.

FIG. 5 shows two cycles of incorporation of labelled and blocked DGTP, DCTP and DATP respectively (compounds 18, 24 and 32).

FIG. 6 shows six cycles of incorporation of labelled and blocked DTTP (compound 6).

FIG. 7 shows the effective blocking by compound 38 (a 3′-0allyl nucleotide of the invention).

The present invention relates to nucleotide or nucleoside molecules that are modified by the reversible covalent attachment of a 3′-OH blocking groups thereto, and which molecules may be used in reactions where blocked nucleotide or nucleoside molecules are required, such as in sequencing reactions, polynucleotide synthesis and the like.

Where the blocking group is an allyl group, it may be introduced into the 3′-position using standard literature procedures such as that used by Metzker (infra).

The allyl groups are removed by reacting in aqueous solution a compound of formula R—O-allyl, R₂N(allyl), RNH(allyl), RN(allyl)₂ or R—S-allyl (wherein R is a water-soluble biological molecule) with a transition metal, wherein said transition metal is capable of forming a metal allyl complex, in the presence of one or more ligands selected from the group comprising water-soluble phosphine and water-soluble mixed nitrogen-phosphine ligands.

The water-soluble biological molecule is not particularly restricted provided, of course, it contains one or more hydroxyl, acid, amino, amide or thiol functionalities protected with an allyl group. Allyl esters are examples of compounds of formula R—O-allyl. Preferred functionalities are hydroxyl and amino.

As used herein the term biological molecule is used to embrace any molecules or class of molecule which performs a biological role. Such molecules include for example, polynucleotides such as DNA and RNA, oligonucleotides and single nucleotides. In addition, peptides and peptide mimetics, such as enzymes and hormones etc., are embraced by the invention. Compounds which comprise a secondary amide linkage, such as peptides, or a secondary amine, where such compounds are allylated on the nitrogen atom of the secondary amine or amide, are examples of compounds of formula R₂N(allyl) in which both R groups belong to the same biological molecule. Particularly preferred compounds however are polynucleotides, (including oligonucleotides) and nucleotides and nucleosides, preferably those which contain one base to which is attached a detectable label linked through a cleavable linker. Such compounds are useful in the determination of sequences of oligonucleotides as described herein.

Transition metals of use in the invention are any which may form metal allyl complexes, for example platinum, palladium, rhodium, ruthenium, osmium and iridium. Palladium is preferred.

The transition metal, e.g. palladium, is conveniently introduced as a salt, e.g. as a halide. Mixed salts such as Na₂PdCl₄ may also be used. Other appropriate salts and compounds will be readily determined by the skilled person and are commercially available, e.g. from Aldrich Chemical Company.

Suitable ligands are any phosphine or mixed nitrogen-phosphine ligands known to those skilled in the art, characterised in that the ligands are derivatised so as to render them water-soluble, e.g. by introducing one or more sulfonate, amine, hydroxyl (preferably a plurality of hydroxyl) or carboxylate residues. Where amine residues are present, formation of amine salts may assist the solublisation of the ligand and thus the metal-allyl complex. Examples of appropriate ligands are triaryl phosphines, e.g. triphenyl phosphine, derivatised so as to make them water-soluble. Also preferred are trialkyl phosphines, e.g. tri-C₁₋₆-alkyl phosphines such as triethyl phosphines; such trialkyl phosphines are likewise derivatised so as to make them water-soluble. Sulfonate-containing and carboxylate-containing phosphines are particularly preferred; an example of the former 3,3′,3″-phosphinidynetris (benzenesulfonic acid) which is commercially available from Aldrich Chemical Company as the trisodium salt; and a preferred example of the latter is tris(2-carboxyethyl)phosphine which is available from Aldrich as the hydrochloride salt.

The derivatised water-soluble phosphines and nitrogen-containing phosphines described herein may be used as their salts (e.g. as the hydrochloride or sodium salts) or, for example, in the case of the sulfonic and carboxylic acid-containing phosphines described herein, as the free acids. Thus 3,3′,3″-phosphinidynetris (benzenesulfonic acid) and tris(2-carboxyethyl)phosphines may be introduced either as the triacids or the trisodium salts. Other appropriate salts will be evident to those skilled in the art. The existence in salt form is not particularly important provided the phosphines are soluble in aqueous solution.

Other ligands which may be used to include the following:

The skilled person will be aware that the atoms chelated to the transition metal in the water soluble complex may be part of mono- or polydentate ligands. Some such polydentate ligands are shown above. Whilst monodentate ligands are preferred, the invention thus also embraces methods which use water-soluble bi-, tri-, tetra-, penta- and hexadentate water-soluble phosphine and water-soluble nitrogen-containing phosphine ligands

The various aspects of the invention relating to allyl blocking groups are of particular utility in sequencing polynucleotides wherein the 3′-OH is allylated. However, when present, the 2′-OH is equally amenable to allylation, and to deprotection according to the method of the invention if necessary. In fact any allylated alcohol may be deprotected according to the method of the invention. Preferred allylated alcohols, however, are those derived from primary and secondary alcohols. Particularly preferred are allylated nucleosides and nucleotides as described herein. It is possible to deprotect tertiary allylated alcohols—the reaction is simply slower (although deprotection may be in such, and other deprotections of this invention, accelerated if necessary by heating the solution, e.g. to 40° C., preferably 50° C. or higher such as approximately 60° C. or even up to 80° C.).

It is also possible to deprotect allylated primary or secondary amines and allylated thiols.

As noted earlier, the aqueous solution in which allyl deprotection is effected need not be 100% (as the continuous phase). However, substantially pure water (e.g. at least 98 vol % preferably about 100 vol %) is preferred. Cosolvents are generally not required although they can assist in the solublisation of the allylated substrate for the deallylation. Generally, biomolecules are readily soluble in water (e.g. pure water) in which the deprotection reaction described herein may be effected. If desirable, one or more water-miscible cosolvents may be employed. Appropriate solvents include acetonitrile or dimethylsulfoxide, methanol, ethanol and acetone, methanol being preferred. Less preferred solvents include tetrahydrofuran (THF) and dioxane.

In the method of allyl deprotection according to the invention, a soluble metal complex is formed comprising a transition metal and one or more water-soluble phosphine and water-soluble nitrogen-containing phosphine ligands. More than one type of water-soluble phosphine/nitrogen-containing phosphine ligand may be used in a deallylation reaction although generally only one type of these classes of ligand will be used in a given reaction. We believe the deallylation reaction to be catalytic. Accordingly, the quantity of transition metal, e.g. palladium, may be less than 1 mol % (calculated relative to the allyl-protected compound to be deprotected). Advantageously the amount of catalyst may be much less than 1 mol %, e.g. <0.50 mol %, preferably <0.10 mol %, particularly <0.05mol %. Even lower quantities of metal may be used, for example <0.03 or even <0.01 mol %. As those skilled in the art will be aware, however, as quantity of catalyst is reduced, so too is the speed of the reaction. The skilled person will be able to judge, in any instance, the precise quantity of transition metal and thus catalyst most optimally suited to any particular deallylation reaction.

In contrast to the amount of metal required in forming the active catalyst, the quantity of water-soluble phosphorus-containing ligand(s) used must be greater than 1 molar equivalent (again calculated relative to the allyl-protected compound to be deprotected). Preferably greater than 4, e.g. greater than 6, for example 8-12 molar equivalents of ligand may be used. Even higher quantities of ligand e.g. >20 mole equivalents may be used if desired.

The skilled person will be able to determine the quantity of ligand best suited to any individual reaction.

Where the blocking group is any of —C(R′)₂—O—R″, —C(R′)₂—N (R″)₂, —C(R′)₂—N(H)R″, —C(R′)₂—S—R″ and —C(R′)₂—F, i.e. of formula Z, each R′ may be independently H or an alkyl

The intermediates produced advantageously spontaneously dissociate under aqueous conditions back to the natural 3′ hydroxy structure, which permits further incorporation of another nucleotide. Any appropriate protecting group may be used, as discussed herein. Preferably, Z is of formula —C(R′)₂—O—R″, —C(R′)₂—N(R″)₂, —C(R′)₂—N(H)R″ and —C(R′)₂—R″. Particularly preferably, Z is of the formula —C(R′)₂—O—R″, —C(R′)₂—N(R″)₂, and —C(R′)₂—SR″. R″ may be a benzyl group or a substituted benzyl group.

One example of groups of structure —O-Z wherein Z is —C(R′)₂—N(R″)₂ are those in which —N(R″)₂ is azido (—N₃). One preferred such example is azidomethyl wherein each R′ is H. Alternatively, R′ in Z groups of formula —C(R′))₂—N₃ and other Z groups may be any of the other groups discussed herein.

Examples of typical R′ groups include C₁₋₆ alkyl, particularly methyl and ethyl, and the following (in which each structure shows the bond which connects the R′ moiety to the carbon atom to which it is attached in the Z groups; the asterisks (*) indicate the points of attachment):

(wherein each R is an optionally substituted C₁₋₁₀ alkyl group, an optionally substituted alkoxy group, a halogen atom or functional group such as hydroxyl, amino, cyano, nitro, carboxyl and the like) and “Het” is a heterocyclic (which may for example be a heteroaryl group). These R′ groups shown above are preferred where the other R′ group is the same as the first or is hydrogen. Preferred Z groups are of formula C(R′)₂N₃ in which the R′ groups are selected from the structures given above and hydrogen; or in which (R′)₂ represents an alkylidene group of formula ═C(R′″)₂, e.g. ═C(Me)₂.

Where molecules contain Z groups of formula C(R′)₂N₃, the azido group may be converted to amino by contacting such molecules with the phosphine or nitrogen-containing phosphines ligands described in detail in connection with the transition metal complexes which serve to cleave the allyl groups from compounds of formula PN—O-allyl, formula R—O-allyl, R₂N(allyl), RNH(allyl), RN(allyl) ₂ and R—S-allyl. When transforming azido to amino, however, no transition metal is necessary. Alternatively, the azido group in Z groups of formula C(R′)₂N₃ may be converted to amino by contacting such molecules with the thiols, in particular water-soluble thiols such as dithiothreitol (DTT).

Where an R′ group represents a detectable label attached through a linking group, the other R′ group or any other part of “Z” will generally not contain a detectable label, nor will the base of the nucleoside or nucleotide contain a detectable label. Appropriate linking groups for connecting the detectable label to the 3′blocking group will be known to the skilled person and examples of such groups are described in greater detail hereinafter.

Exemplary of linkages in R′ groups containing detectable labels are those which contain one or more amide bonds. Such linkers may also contain an arylene, e.g. phenylene, group in the chain (i.e. a linking moiety —Ar— where the phenyl ring is part of the linker by way of its 1,4-disposed carbon atoms). The phenyl ring may be substituted at its non-bonded position with one or more substituents such as alkyl, hydroxyl, alkyloxy, halide, nitro, carboxyl or cyano and the like, particularly electron-withdrawing groups, which electron-withdrawing is either by induction or resonance. The linkage in the R′ group may also include moieties such a —O—, S(O)_(q), wherein q is 0, 1 or 2 or NH or Nalkyl. Examples of such Z groups are as follows:

(wherein EWG stands for electron-withdrawing group; n is an integer of from 1 to 50, preferably 2-20, e.g. 3 to 10; and fluor indicates a fluorophore). An example of an electron-withdrawing group by resonance is nitro; a group which acts through induction is fluoro. The skilled person will be aware of other appropriate electron-withdrawing groups. In addition, it will be understood that whilst a fluorophore is indicated as being the detectable label present, other detectable groups as discussed in greater detail hereinafter may be included instead.

Where a detectable label is attached to a nucleotide at the 3′-blocking position, the linker need not be cleavable to have utility in those reactions, such as DNA sequencing, described herein which require the label to be “read” and removed before the next step of the reaction. This is because the label, when attached to the 3′block, will become separated from the nucleotide when the intermediate compounds described herein collapse so as to replace the “Z” group with a hydrogen atom. As noted above, each R″ is or is part of a removable protecting group. R″ may be a benzyl group or is substituted benzyl group is an alternative embodiment.

It will be appreciated that where it is possible to incorporate a detectable label onto a group R″, the invention embraces this possibility. Thus, where R″ is a benzyl group, the phenyl ring may bear a linker group to which is attached a fluorophore or other detectable group. Introduction of such groups does not prevent the ability to remove such R″s and they do not prevent the generation of the desired unstable intermediates during deprotection of blocking groups of formula Z.

As is known in the art, a “nucleotide” consists of a nitrogenous base, a sugar, and one or more phosphate groups. They are monomeric units of a nucleic acid sequence. In RNA, the sugar is a ribose, and in DNA a deoxyribose, i.e. a sugar lacking a hydroxyl group that is present in ribose. The nitrogenous base is a derivative of purine or pyrimidine. The purines are adenine (A) and guanine (G), and the pyrimidines are cytosine (C) and thymine (T) (or in the context of RNA, uracil (U)). The C-1 atom of deoxyribose is bonded to N-1 of a pyrimidine or N-9 of a purine. A nucleotide is also a phosphate ester or a nucleoside, with esterification occurring on the hydroxyl group attached to C-5 of the sugar. Nucleotides are usually mono, di- or triphosphates.

A “nucleoside” is structurally similar to a nucleotide, but is missing the phosphate moieties. An example of a nucleoside analogue would be one in which the label is linked to the base and there is no phosphate group attached to the sugar molecule.

Although the base is usually referred to as a purine or pyrimidine, the skilled person will appreciate that derivatives and analogues are available which do not alter the capability of the nucleotide or nucleoside to undergo Watson-Crick base pairing. “Derivative” or “analogue” means a compound or molecule whose core structure is the same as, or closely resembles that of, a parent compound, but which has a chemical or physical modification, such as a different or additional side group, or 2′ and or 3′ blocking groups, which allows the derivative nucleotide or nucleoside to be linked to another molecule. For example, the base can be a deazapurine. The derivatives should be capable of undergoing Watson-Crick pairing. “Derivative” and “analogue” also mean a synthetic nucleotide or nucleoside derivative having modified base moieties and/or modified sugar moieties. Such derivatives and analogs are discussed in, e.g., Scheit, Nucleotide Analogs (John Wiley & Son, 1980) and Uhlman et al., Chemical Reviews 90:543-584, 1990. Nucleotide analogs can also comprise modified phosphodiester linkages, including phosphorothioate, phosphorodithioate, alkyl-phosphonate, phosphoranilidate and phosphoramidate linkages. The analogs should be capable of undergoing Watson-Crick base pairing. “Derivative”, “analog” and “modified” as used herein, may be used interchangeably, and are encompassed by the terms “nucleotide” and “nucleoside” defined herein.

In the context of the present invention, the term “incorporating” means becoming part of a nucleic acid (eg DNA) molecule or oligonucleotide or primer. An oligonucleotide refers to a synthetic or natural molecule comprising a covalently linked sequence of nucleotides which are formed by a phosphodiester or modified phosphodiester bond between the 3′ position of the pentose on one nucleotide and the 5′ position of the pentose on an adjacent nucleotide.

The term “alkyl” covers straight chain, branched chain and cycloalkyl groups. Unless the context indicates otherwise, the term “alkyl” refers to groups having 1 to 10 carbon atoms, for example 1 to 8 carbon atoms, and typically from 1 to 6 carbon atoms, for example from 1 to 4 carbon atoms. Examples of alkyl groups include methyl, ethyl, propyl, isopropyl, n-butyl, isobutyl, tert-butyl, n-pentyl, 2-pentyl, 3-pentyl, 2-methyl butyl, 3-methyl butyl, and n-hexyl and its isomers.

Examples of cycloalkyl groups are those having from 3 to 10 ring atoms, particular examples including those derived from cyclopropane, cyclobutane, cyclopentane, cyclohexane and cycloheptane, bicycloheptane and decalin.

Where alkyl (including cycloalkyl) groups are substituted, particularly where these form either both of the R′ groups of the molecules of the invention, examples of appropriate substituents include halogen substituents or functional groups such as hydroxyl, amino, cyano, nitro, carboxyl and the like. Such groups may also be substituents, where appropriate, of the other R′ groups in the molecules of the invention.

The term amino refers to groups of type NR*R**, wherein R* and R** are independently selected from hydrogen, a C₁₋₆ alkyl group (also referred to as C₁₋₆ alkylamino or di-C₁₋₆ alkylamino).

The term “halogen” as used herein includes fluorine, chlorine, bromine and iodine.

The nucleotide molecules of the present invention are suitable for use in many different methods where the detection of nucleotides is required.

DNA sequencing methods, such as those outlined in U.S. Pat. No. 5,302,509 can be carried out using the nucleotides.

The present invention can make use of conventional detectable labels. Detection can be carried out by any suitable method, including fluorescence spectroscopy or by other optical means. The preferred label is a fluorophore, which, after absorption of energy, emits radiation at a defined wavelength. Many suitable fluorescent labels are known. For example, Welch et al. (Chem. Eur. J. 5(3):951-960, 1999) discloses dansyl-functionalised fluorescent moieties that can be used in the present invention. Zhu et al. (Cytometry 28:206-211, 1997) describes the use of the fluorescent labels Cy3 and Cy5, which can also be used in the present invention. Labels suitable for use are also disclosed in Prober et al. (Science 238:336-341, 1987); Connell et al. (BioTechniques 5(4):342-384, 1987), Ansorge et al. (Nucl. Acids Res. 15(11):4593-4602, 1987) and Smith et al. (Nature 321:674, 1986). Other commercially available fluorescent labels include, but are not limited to, fluorescein, rhodamine (including TMR, texas red and Rox), alexa, bodipy, acridine, coumarin, pyrene, benzanthracene and the cyanins.

Multiple labels can also be used in the invention. For example, bi-fluorophore FRET cassettes (Tet. Let. 46:8867-8871, 2000) are well known in the art and can be utilised in the present invention. Multi-fluor dendrimeric systems (J. Amer. Chem. Soc. 123:8101-8108, 2001) can also be used.

Although fluorescent labels are preferred, other forms of detectable labels will be apparent as useful to those of ordinary skill. For example, microparticles, including quantum dots (Empodocles et al., Nature 399:126-130, 1999), gold nanoparticles (Reichert et al., Anal. Chem. 72:6025-6029, 2000) and microbeads (Lacoste et al., Proc. Natl. Acad. Sci USA 97(17):9461-9466, 2000) can all be used.

Multi-component labels can also be used in the invention. A multi-component label is one which is dependent on the interaction with a further compound for detection. The most common multi-component label used in biology is the biotin-streptavidin system. Biotin is used as the label attached to the nucleotide base. Streptavidin is then added separately to enable detection to occur. Other multi-component systems are available. For example, dinitrophenol has a commercially available fluorescent antibody that can be used for detection.

The invention has been and will be further described with reference to nucleotides. However, unless indicated otherwise, the reference to nucleotides is also intended to be applicable to nucleosides. The invention will also be further described with reference to DNA, although the description will also be applicable to RNA, PNA, and other nucleic acids, unless otherwise indicated.

The modified nucleotides of the invention may use a cleavable linker to attach the label to the nucleotide. The use of a cleavable linker ensures that the label can, if required, be removed after detection, avoiding any interfering signal with any labelled nucleotide incorporated subsequently.

Generally, the use of cleavable linkers is preferable, particularly in the methods of the invention hereinbefore described except where the detectable label is attached to the nucleotide by forming part of the “Z” group.

Those skilled in the art will be aware of the utility of dideoxynucleoside triphosphates in so-called Sanger sequencing methods, and related protocols (Sanger-type), which rely upon randomised chain-termination at a particular type of nucleotide. An example of a Sanger-type sequencing protocol is the BASS method described by Metzker (infra). Other Sanger-type sequencing methods will be known to those skilled in the art.

Sanger and Sanger-type methods generally operate by the conducting of an experiment in which eight types of nucleotides are provided, four of which contain a 3′OH group; and four of which omit the OH group and which are labeled differently from each other. The nucleotides used which omit the 3′OH group—dideoxy nucleotides—are conventially abbreviated to ddNTPs. As is known by the skilled person, since the ddNTPs are labeled differently, by determining the positions of the terminal nucleotides incorporated, and combining this information, the sequence of the target oligonucleotide may be determined.

The nucleotides of the present invention, it will be recognized, may be of utility in Sanger methods and related protocols since the same effect achieved by using ddNTPs may be achieved by using the novel 3′-OH blocking groups described herein: both prevent incorporation of subsequent nucleotides.

The use of the nucleotides according to the present invention in Sanger and Sanger-type sequencing methods, wherein the linker connecting the detectable label to the nucleotide may or may not be cleavable, forms a still further aspect of this invention. Viewed from this aspect, the invention provides the use of such nucleotides in a Sanger or a Sanger-type sequencing method.

Where 3′-OH Z-blocked nucleotides according to the present invention are used, it will be appreciated that the detectable labels attached to the nucleotides need not be connected via cleavable linkers, since in each instance where a labelled nucleotide of the invention is incorporated, no nucleotides need to be subsequently incorporated and thus the label need not be removed from the nucleotide.

Moreover, it will be appreciated that monitoring of the incorporation of 3′OH blocked nucleotides may be determined by use of radioactive ³²P in the phosphate groups attached. These may be present in either the ddNTPs themselves or in the primers used for extension. Where the blocking groups are of formula “Z”, this represents a further aspect of the invention.

Viewed from this aspect, the invention provides the use of a nucleotide having a 3′OH group blocked with a “Z” group in a Sanger or a Sanger-type sequencing method. In this embodiment, a ³²P detectable label may be present in either the ddNTPs used in the primer used for extension.

Cleavable linkers are known in the art, and conventional chemistry can be applied to attach a linker to a nucleotide base and a label. The linker can be cleaved by any suitable method, including exposure to acids, bases, nucleophiles, electrophiles, radicals, metals, reducing or oxidising agents, light, temperature, enzymes etc. The linker as discussed herein may also be cleaved with the same catalyst used to cleave the 3′O-blocking group bond. Suitable linkers can be adapted from standard chemical blocking groups, as disclosed in Greene & Wuts, Protective Groups in Organic Synthesis, John Wiley & Sons. Further suitable cleavable linkers used in solid-phase synthesis are disclosed in Guillier et al. (Chem. Rev. 100:2092-2157, 2000).

The use of the term “cleavable linker” is not meant to imply that the whole linker is required to be removed from e.g., the nucleotide base. Where the detectable label is attached to the base, the nucleoside cleavage site can be located at a position on the linker that ensures that part of the linker remains attached to the nucleotide base after cleavage.

Where the detectable label is attached to the base, the linker can be attached at any position on the nucleotide base provided that Watson-Crick base pairing can still be carried out. In the context of purine bases, it is preferred if the linker is attached via the 7-position of the purine or the preferred deazapurine analogue, via an 8-modified purine, via an N-6 modified adenosine or an N-2 modified guanine. For pyrimidines, attachment is preferably via the 5-position on cytosine, thymidine or uracil and the N-4 position on cytosine. Suitable nucleotide structures are shown in FIG. 1. For each structure in FIG. 1 X can be H, phosphate, diphosphate or triphosphate. R₁ and R₂ can be the same or different, and are selected from H, OH, O-allyl, or formula Z as described herein or any other group which can be transformed into an OH, including, but not limited to, a carbonyl, provided that at least one of R₁ and R₂ is O-allyl or formula Z as described herein. Some suitable functional groups for R₁ and R₂ include the structures shown in FIGS. 3 and 4.

Suitable linkers are shown in FIG. 3 and include, but are not limited to, disulfide linkers (1), acid labile linkers (2, 3, 4 and 5; including dialkoxybenzyl linkers (e.g., 2), Sieber linkers (e.g., 3), indole linkers (e.g., 4), t-butyl Sieber linkers (e.g., 5)), electrophilically cleavable linkers, nucleophilically cleavable linkers, photocleavable linkers, cleavage under reductive conditions, oxidative conditions, cleavage via use of safety-catch linkers, and cleavage by elimination mechanisms.

A. Electrophilically Cleaved Linkers.

Electrophilically cleaved linkers are typically cleaved by protons and include cleavages sensitive to acids. Suitable linkers include the modified benzylic systems such as trityl, p-alkoxybenzyl esters and p-alkoxybenzyl amides. Other suitable linkers include tert-butyloxycarbonyl (Boc) groups and the acetal system.

The use of thiophilic metals, such as nickel, silver or mercury, in the cleavage of thioacetal or other sulfur-containing protecting groups can also be considered for the preparation of suitable linker molecules.

B. Nucleophilically Cleaved Linkers.

Nucleophilic cleavage is also a well recognised method in the preparation of linker molecules. Groups such as esters that are labile in water (i.e., can be cleaved simply at basic pH) and groups that are labile to non-aqueous nucleophiles, can be used. Fluoride ions can be used to cleave silicon-oxygen bonds in groups such as triisopropyl silane (TIPS) or t-butyldimethyl silane (TBDMS).

C. Photocleavable Linkers.

Photocleavable linkers have been used widely in carbohydrate chemistry. It is preferable that the light required to activate cleavage does not affect the other components of the modified nucleotides. For example, if a fluorophore is used as the label, it is preferable if this absorbs light of a different wavelength to that required to cleave the linker molecule. Suitable linkers include those based on O-nitrobenzyl compounds and nitroveratryl compounds. Linkers based on benzoin chemistry can also be used (Lee et al., J. Org. Chem. 64:3454-3460, 1999).

D. Cleavage Under Reductive Conditions

There are many linkers known that are susceptible to reductive cleavage. Catalytic hydrogenation using palladium-based catalysts has been used to cleave benzyl and benzyloxycarbonyl groups. Disulfide bond reduction is also known in the art.

E. Cleavage Under Oxidative Conditions

Oxidation-based approaches are well known in the art. These include oxidation of p-alkoxybenzyl groups and the oxidation of sulfur and selenium linkers. The use of aqueous iodine to cleave disulfides and other sulfur or selenium-based linkers is also within the scope of the invention.

F. Safety-Catch Linkers

Safety-catch linkers are those that cleave in two steps. In a preferred system the first step is the generation of a reactive nucleophilic center followed by a second step involving an intra-molecular cyclization that results in cleavage. For example, levulinic ester linkages can be treated with hydrazine or photochemistry to release an active amine, which can then be cyclised to cleave an ester elsewhere in the molecule (Burgess et al., J. Org. Chem. 62:5165-5168, 1997).

G. Cleavage by Elimination Mechanisms

Elimination reactions can also be used. For example, the base-catalysed elimination of groups such as Fmoc and cyanoethyl, and palladium-catalysed reductive elimination of allylic systems, can be used.

As well as the cleavage site, the linker can comprise a spacer unit. The spacer distances e.g., the nucleotide base from the cleavage site or label. The length of the linker is unimportant provided that the label is held a sufficient distance from the nucleotide so as not to interfere with any interaction between the nucleotide and an enzyme.

In a preferred embodiment the linker may consist of the same functionality as the block. This will make the deprotection and deblocking process more efficient, as only a single treatment will be required to remove both the label and the block.

Particularly preferred linkers are phosphine-cleavable azide containing linkers.

A method for determining the sequence of a target polynucleotide can be carried out by contacting the target polynucleotide separately with the different nucleotides to form the complement to that of the target polynucleotide, and detecting the incorporation of the nucleotides. Such a method makes use of polymerisation, whereby a polymerase enzyme extends the complementary strand by incorporating the correct nucleotide complementary to that on the target. The polymerisation reaction also requires a specific primer to initiate polymerisation.

For each cycle, the incorporation of the modified nucleotide is carried out by the polymerase enzyme, and the incorporation event is then determined. Many different polymerase enzymes exist, and it will be evident to the person of ordinary skill which is most appropriate to use. Preferred enzymes include DNA polymerase I, the Klenow fragment, DNA polymerase III, T4 or T7 DNA polymerase, Taq polymerase or Vent polymerase. Polymerases engineered to have specific properties can also be used. As noted earlier, the molecule is preferably incorporated by a polymerase and particularly from Thermococcus sp., such as 9° N. Even more preferably, the polymerase is a mutant 9° N A485L and even more preferably is a double mutant Y409V and A485L. An example of one such preferred enzyme is Thermococcus sp. 9° N exo −Y409V A485L available from New England Biolabs. Examples of such appropriate polymerases are disclosed in Proc. Natl. Acad. Sci. USA, 1996(93), pp 5281-5285, Nucleic Acids Research, 1999(27), pp 2454-2553 and Acids Research, 2002(30), pp 605-613.

The sequencing methods are preferably carried out with the target polynucleotide arrayed on a solid support. Multiple target polynucleotides can be immobilised on the solid support through linker molecules, or can be attached to particles, e.g., microspheres, which can also be attached to a solid support material. The polynucleotides can be attached to the solid support by a number of means, including the use of biotin-avidin interactions. Methods for immobilizing polynucleotides on a solid support are well known in the art, and include lithographic techniques and “spotting” individual polynucleotides in defined positions on a solid support. Suitable solid supports are known in the art, and include glass slides and beads, ceramic and silicon surfaces and plastic materials. The support is usually a flat surface although microscopic beads (microspheres) can also be used and can in turn be attached to another solid support by known means. The microspheres can be of any suitable size, typically in the range of from 10 nm to 100 nm in diameter. In a preferred embodiment, the polynucleotides are attached directly onto a planar surface, preferably a planar glass surface. Attachment will preferably be by means of a covalent linkage. Preferably, the arrays that are used are single molecule arrays that comprise polynucleotides in distinct optically resolvable areas, e.g., as disclosed in International Application No. WO00/06770.

The sequencing method can be carried out on both single polynucleotide molecule and multi-polynucleotide molecule arrays, i.e., arrays of distinct individual polynucleotide molecules and arrays of distinct regions comprising multiple copies of one individual polynucleotide molecule. Single molecule arrays allow each individual polynucleotide to be resolved separately. The use of single molecule arrays is preferred. Sequencing single molecule arrays non-destructively allows a spatially addressable array to be formed.

The method makes use of the polymerisation reaction to generate the complementary sequence of the target. Conditions compatible with polymerization reactions will be apparent to the skilled person.

To carry out the polymerase reaction it will usually be necessary to first anneal a primer sequence to the target polynucleotide, the primer sequence being recognised by the polymerase enzyme and acting as an initiation site for the subsequent extension of the complementary strand. The primer sequence may be added as a separate component with respect to the target polynucleotide. Alternatively, the primer and the target polynucleotide may each be part of one single stranded molecule, with the primer portion forming an intramolecular duplex with a part of the target, i.e., a hairpin loop structure. This structure may be immobilised to the solid support at any point on the molecule. Other conditions necessary for carrying out the polymerase reaction, including temperature, pH, buffer compositions etc., will be apparent to those skilled in the art.

The modified nucleotides of the invention are then brought into contact with the target polynucleotide, to allow polymerisation to occur. The nucleotides may be added sequentially, i.e., separate addition of each nucleotide type (A, T, G or C), or added together. If they are added together, it is preferable for each nucleotide type to be labelled with a different label.

This polymerisation step is allowed to proceed for a time sufficient to allow incorporation of a nucleotide.

Nucleotides that are not incorporated are then removed, for example, by subjecting the array to a washing step, and detection of the incorporated labels may then be carried out.

Detection may be by conventional means, for example if the label is a fluorescent moiety, detection of an incorporated base may be carried out by using a confocal scanning microscope to scan the surface of the array with a laser, to image a fluorophore bound directly to the incorporated base. Alternatively, a sensitive 2-D detector, such as a charge-coupled detector (CCD), can be used to visualise the individual signals generated. However, other techniques such as scanning near-field optical microscopy (SNOM) are available and may be used when imaging dense arrays. For example, using SNOM, individual polynucleotides may be distinguished when separated by a distance of less than 100 nm, e.g., 10 nm to 10 μm. For a description of scanning near-field optical microscopy, see Moyer et al., Laser Focus World 29:10, 1993. Suitable apparatus used for imaging polynucleotide arrays are known and the technical set-up will be apparent to the skilled person.

After detection, the label may be removed using suitable conditions that cleave the linker and the 3′OH block to allow for incorporation of further modified nucleotides of the invention. Appropriate conditions may be those described herein for allyl group and for “Z” group deprotections. These conditions can serve to deprotect both the linker (if cleavable) and the blocking group. Alternatively, the linker may be deprotected separately from the allyl group by employing methods of cleaving the linker known in the art (which do not sever the 0-blocking group bond) followed by deprotection.

This invention may be further understood with reference to the following examples which serve to illustrate the invention and not to limit its scope.

3′-OH Protected With an Azidomethyl Group as a Protected Form of a Hemiaminal

Nucleotides bearing this blocking group at the 3′position have been synthesised, shown to be successfully incorporated by DNA polymerases, block efficiently and may be subsequently removed under neutral, aqueous conditions using water soluble phosphines or thiols allowing further extension:

5-[3-(2,2,2-trifluoroacetamido)-prop-1-ynyl]-2′-deoxyuridine (1)

To a solution of 5-iodo-2′-deoxyuridine (1.05 g, 2.96 mmol) and CuI (114 mg, 0.60 mmol) in dry DMF (21 ml) was added triethylamine (0.9 ml). After stirring for 5 min trifluoro-N-prop-2-ynyl-acetamide (1.35 g, 9.0 mmol) and Pd(PPh₃)₄ (330 mg, 0.29 mmol) were added to the mixture and the reaction was stirred at room temperature in the dark for 16 h. Metanol (MeOH) (40 ml) and bicarbonate dowex added to the reaction mixture and stirred for 45 min. The mixture was filtered and the filtrate washed with MeOH and the solvent was removed under vacuum. The crude mixture was purified by chromatography on silica (ethyl acetate (EtOAc) to EtOAc:MeOH 95:5) to give slightly yellow crystals (794 mg, 71%). ¹H NMR (d₆ dimethylsulfoxide (DMSO)) δ 2.13-2.17 (m, 2H, H−2′), 3.57-3.65 (m, 2H, H−5′), 3.81-3.84 (m, 1H, H−4′), 4.23-4.27 (m, 3H, H−3′, CH₂N), 5.13 (t, J=5.0 Hz, 1H, OH), 5.20 (d, J=4.3 Hz, 1H, OH), 6.13 (t, J=6.7 Hz, 1H, H−1′), 8.23 (s, 1H, H−6), 10.11 (t, J=5.6 Hz, 1H, NH), 11.70 (br s, 1H, NH). Mass (−ve electrospray) calcd for C₁₄H₁₄F₃N₃O₆ 377.08, found 376.

5′-O-(tert-butydimethylsilyl)-5-[3-(2,2,2-trifluoroacetamido)-prop-1-ynyl]-2′-deoxyuridine (2)

To a solution of (1) (656 mg, 1.74 mmol) in dry DMF (15 ml) was added t-butyldimethylsilylchloride (288 mg, 1.91 mmol) in small portions, followed by imidazole (130 mg, 1.91 mmol). The reaction was followed by TLC and was completed after stirring for 8 h at room temperature. The reaction was quenched with sat. aq. NaCl solution. EtOAc (25 ml) was added to the reaction mixture and the aqueous layer was extracted with EtOAc three times. After drying the combined organics (MgSO₄), the solvent was removed under vacuum. Purification by chromatography on silica (EtOAc:petroleum ether 8:2) gave (2) as slightly yellow crystals (676 mg, 83%). ¹H NMR (d₆ DMSO) δ 0.00 (s, 6H, CH₃), 0.79 (s, 9H, tBu), 1.93-2.00 (m, 1H, H−2′), 2.06-2.11 (m, 1H, H−2′), 3.63-3.75 (m, 2H, H−5′), 3.79-3.80 (m, 1H, H−4′), 4.12-4.14 (m, 3H, H−3′, CH₂N), 5.22 (d, J=4.1 Hz, 1H, OH), 6.03 (t, J=6.9 Hz, 1H, H−1′), 7.86 (s, 1H, H−6), 9.95 (t, J=5.4 Hz, 1H, NH), 11.61 (br s, 1H, NH). Mass (−ve electrospray) calcd for C₂₀H₂₈F₃N₃O₆Si 491.17, found 490.

5′-O-(tert-Butydimethylsilyl)-3′-O-methylthiomethyl-5-[3-(2,2,2-trifluoroacetamido)-prop-1-ynyl]-2′-deoxyuridine (3)

To a solution of (2) (1.84 g, 3.7 mmol) in dry DMSO (7 ml) was added acetic acid (3.2 ml) and acetic anhydride (10.2 ml). The mixture was stirred for 2 days at room temperature, before it was quenched with sat. aq. NaHCO₃. EtOAc (50 ml) was added and the aqueous layer was extracted three times with ethyl acetate. The combined organic layers were washed with sat. aq. NaHCO₃ solution and dried (MgSO₄). After removing the solvent under reduced pressure, the product (3) was purified by chromatography on silica (EtOAc:petroleum ether 8:2) yielding a clear sticky oil (1.83 g, 89%). ¹H NMR (d₆ DMSO): δ 0.00 (s, 6H, CH₃), 0.79 (s, 9H, tBu), 1.96-2.06 (m, 1H, H−2′), 1.99 (s, 3H, SCH₃), 2.20-2.26 (m, 1H, H−2′-), 3.63-3.74 (m, 2H, H−5′), 3.92-3.95 (m, 1H, H−4′), 4.11-4.13 (m, 2H, CH₂), 4.28-4.30 (m, 1H, H−3′), 4.59 (br s, 2H, CH₂), 5.97 (t, J=6.9 Hz, 1H, H−1′), 7.85 (s, 1H, H−6), 9.95 (t, J=5.3 Hz, 1H, NH), 11.64 (s, 1H, NH). Mass (−ve electrospray) calcd for C₂₂H₃₂F₃N₃O₆SSi 551.17, found 550.

3′-O-Azidomethyl-5-[3-(2,2,2-trifluoroacetamido)-prop-1-ynyl]-2′-deoxyuridine (4)

To a solution of (3) (348 mg, 0.63 mmol) and cyclohexene (0.32 ml, 3.2 mmol) in dry CH₂Cl₂ (5 ml) at 4° C., sulfurylchoride (1M in CH₂Cl₂, 0.76 ml, 0.76 mmol) was added drop wise under N₂. After 10 min TLC indicated the full consumption of the nucleoside (3). The solvent was evaporated and the residue was subjected to high vacuum for 20 min. It was then redissolved in dry DMF (3 ml) and treated with NaN₃ (205 mg, 3.15 mmol). The resulting suspension was stirred under room temperature for 2 h. The reaction was quenched with CH₂Cl₂ and the organic layers were washed with sat aq. NaCl solution. After removing the solvent, the resulting yellow gum was redissolved in THF (2 ml) and treated with TBAF (1 M in THF, 0.5 ml) at room temperature for 30 min. The solvent was removed and the reaction worked up with CH₂Cl₂ and sat. aq. NaHCO₃ solution. The aqueous layer was extracted three times with CH₂Cl₂. Purification by chromatography on silica (EtOAc:petroleum ether 1:1 to EtOAc) gave (4) (100 mg, 37%) as a pale yellow foam. ¹H NMR (d₆DMSO) δ 2.15-2.26 (m, 2H, H−2′), 3.47-3.57 (m, 2H, H−5′), 3.88-3.90 (m, 1H, H−4′), 4.14 (d, J=4.7 Hz, 2H, CH₂NH), 4.24-4.27 (m, 1H, H−3′), 4.75 (s, 2H, CH₂N₃), 5.14 (t, J=5.2 Hz, 1H, OH), 5.96-6.00 (m, 1H, H−1′), 8.10 (s, 1H, H−6), 10.00 (s, 1H, NHCOCF₃)), 11.26 (s, 1H, NH).

Preparation of bis(tri-n-butylammonium)pyrophosphate (0.5 M Solution in DMF)

Tetrasodium diphosphate decahydrate (1.5 g, 3.4 mmol) was dissolved in water (34 ml) and the solution was applied to a column of dowex in the H⁺ form. The column was eluted with water. The eluent dropped directly into a cooled (ice bath) and stirred solution of tri-n-butylamine (1.6 ml, 6.8 mmol) in EtOH (14 ml). The column was washed until the pH of the eluent increased to 6. The aq. ethanol solution was evaporated to dryness and then co-evaporated twice with ethanol and twice with anhydrous DMF. The residue was dissolved in DMF (6.7 ml). The pale yellow solution was stored over 4 Å molecular sieves.

3′-O-Azidomethyl-5-(3-amino-prop-1-ynyl)-2′-deoxyuridine 5′-O-nucleoside triphosphate (5)

The nucleoside (4) and proton sponge was dried over P₂O₅ under vacuum overnight. A solution of (4) (92 mg, 0.21 mmol) and proton sponge (90 mg, 0.42 mmol) in trimethylphosphate (0.5 ml) was stirred with 4 Å molecular sieves for 1 h. Freshly distilled POCl₃ (24 μl, 0.26 mmol) was added and the solution was stirred at 4° C. for 2 h. The mixture was slowly warmed up to room temperature and bis (tri-n-butyl ammonium) pyrophosphate (1.7 ml, 0.85 mmol) and anhydrous tri-n-butyl amine (0.4 ml, 1.7 mmol) was added. After 3 min, the reaction was quenched with 0.1 M TEAB (triethylammonium bicarbonate) buffer (15 ml) and stirred for 3 h. The water was removed under reduced pressure and the resulting residue dissolved in concentrated ammonia (ρ0.88, 15 ml) and stirred at room temperature for 16 h. The reaction mixture was then evaporated to dryness. The residue was dissolved in water and the solution applied to a DEAE-Sephadex A-25 column. MPLC was performed with a linear gradient of TEAB. The triphosphate was eluted between 0.7 M and 0.8 M buffer. Fractions containing the product were combined and evaporated to dryness. The residue was dissolved in water and further purified by HPLC. HPLC: t_(r)(5): 18.8 min (Zorbax C18 preparative column, gradient: 5% to 35% B in 30 min, buffer A 0.1M TEAB, buffer B MeCN) The product was isolated as a white foam (76 O.D., 7.6 μmol, 3.8%, ε₂₈₀=10000). ¹H NMR (D₂O) δ 1.79 (s, CH₂), 2.23-2.30; 2.44-2.50 (2×m, 2H, H−2′), 3.85 (m, CH₂NH), 4.10-4.18 (m, 2H, H−5′), 4.27 (br s, H−4′), 4.48-4.50 (m, H−3′), 4.70-4.77 (m, CH₂N₃), 6.21 (t, J=6.6 Hz, H−1′), 8.32 (s, 1H, H−6). ³¹P NMR (D₂O) δ −6.6 (m, 1P, P_(γ)), −10.3 (d, J=18.4 Hz, 1P, P_(α)), −21.1 (m, 1P, P_(β)). Mass (−ve electrospray) calcd for C₁₃H₁₉N₆O₁₄P₃ 576.02, found 575.

Cy-3disulfide Linker

The starting disulfide (4.0 mg, 13.1 μmol) was dissolved in DMF (300 μL) and diisopropylethylamine (4 μL) was slowly added. The mixture was stirred at room temperature and a solution of Cy-3 dye (5 mg, 6.53 μmol) in DMF (300 μL) was added over 10 min. After 3.5 h, on complete reaction, the volatiles were evaporated under reduced pressure and the crude residue was HPLC purified on a Zorbax analytical column SB-C18 with a flow rate of 1 ml/min in 0.1M triethylammonium bicarbonate buffer (buffer A) and CH₃CN (buffer B) using the following gradient: 0.5 min 2% B; 0.31 min 55% B; 33 min 95% B; 0.37 min 95%; 0.39 min 2% B; 0.44 min. 2% B. The expected Cy3-disulfide linker was eluted with a t_(r): 21.8 min. in 70% yield (based on a UV measurement; ε₅₅₀ 150,000 cm⁻¹ M⁻¹ in H₂O) as a hygroscopic solid. ¹H NMR (D₂O) δ 1.31-1.20 (m+t, J=7.2 Hz, 5H, CH₂+CH₃), 1.56-1.47 (m, 2H, CH₂), 1.67 (s, 12H, 4 CH₃), 1.79-1.74 (m, 2H, CH₂), 2.11 (t, J=6.9 Hz, 2H, CH₂), 2.37 (t, J=6.9 Hz, 2H, CH₂), 2.60 (t, J=6.3 Hz, 2H, CH₂), 2.67 (t, J=6.9 Hz, 2H, CH₂), 3.27 (t, J=6.1 Hz, 2H, CH₂), 4.10-4.00 (m, 4H, 2CH₂), 6.29 (dd, J=13.1, 8.1 Hz, 2H, 2 ═CH), 7.29 (dd, 2H, J=8.4, 6.1 Hz, 2 ═CH), 7.75-7.71 (m, 2H, 2 ═CH), 7.78 (s, 2H, ═CH), 8.42 (t, J=12.8 Hz, 1H, ═CH). Mass (−ve electrospray) calcd for C₃₆H₄₇N₃O₉S₄ 793.22, found 792 (M−H), 396 [M/2].

A mixture of Cy3 disulphide linker (2.5 μmol), disuccinimidyl carbonate (0.96 mg, 3.75 μmol) and DMAP (0.46 mg, 3.75 μmol) were dissolved in dry DMF (0.5 ml) and stirred at room temperature for 10 min. The reaction was monitored by TLC (MeOH:CH₂Cl₂ 3:7) until all the dye linker was consumed. Then a solution of (5) (7.5 μmol) and n-Bu₃N (30 μl, 125 μmol) in DMF (0.2 ml) was added to the reaction mixture and stirred at room temperature for 1 h. TLC (MeOH:CH₂Cl₂ 4:6) showed complete consumption of the activated ester and a dark red spot appeared on the baseline. The reaction was quenched with TEAB buffer (0.1M, 10 ml) and loaded on a DEAE Sephadex column (2×5 cm). The column was first eluted with 0.1 M TEAB buffer (100 ml) to wash off organic residues and then 1 M TEAB buffer (100 ml). The desired triphosphate-analogue (6) was eluted out with 1 M TEAB buffer. The fraction containing the product were combined, evaporated and purified by HPLC. HPLC conditions: t_(r)(6): 16.1 min (Zorbax C18 preparative column, gradient: 2% to 55% B in 30 min, buffer A 0.1M TEAB, buffer B MeCN). The product was isolated as dark red solid (1.35 μmol, 54%, ε₅₅₀=150000). ¹H NMR (D₂O) δ 1.17-1.28 (m, 6H 3×CH₂), 1.41-1.48 (m, 3 H, CH₃), 1.64 (s, 12H, 4×CH₃), 1.68-1.71 (m, 2H, CH₂), 2.07-2.10 (m, 3H, H−2′, CH₂), 2.31-2.35 (m, 1H, H−2′), 2.50-2.54 (m, 2H, CH₂), 2.65 (t, J=5.9 Hz, 2H, CH₂), 2.76 (t, J=7.0 Hz, 2H, CH₂), 3.26-3.31 (m, 2H, CH₂), 3.88-3.91 (m, 2H CH₂), 3.94-4.06 (m, 3H, CH₂N, H−5′), 4.16 (br s, 1H, H−4′), 4.42-4.43 (m, 1H, H−3′), 4.72-4.78 (m, 2H, CH₂N₃), 6.24 (dd, J=5.8, 8.2 Hz, H−1′), 6.25 (dd, J=3.5, 8.5 Hz, 2H, H_(Ar)), 7.24, 7.25 (2d, J=14.8 Hz, 2×═CH), 7.69-7.86 (m, 4H, H_(Ar), H−6), 8.42 (t, J=13.4 Hz, ═CH). ³¹P NMR (D₂O) δ −4.85 (m, 1P, P_(γ)), −9.86 (m, 1P, P_(α)), −20.40 (m, 1P, P_(β)). Mass (−ve electrospray) calcd for C₄₉H₆₄N₉O₂₂P₃S₄ 1351.23, found 1372 (M−2H+Na), 1270 [M−80], 1190 [M−160].

5-[3-(2,2,2-Trifluoroacetamido)-prop-1-ynyl]-2′-deoxycytidine (7)

To a solution of 5-iodo-2′-deoxycytidine (10 g, 28.32 mmol) in DMF (200 ml) in a light protected round bottom flask under Argon atmosphere, was added CuI (1.08 g, 5.67 mmol), triethylamine (7.80 ml, 55.60 mmol), 2,2,2-trifluoro-N-prop-2-ynyl-acetamide (12.8 g, 84.76 mmol) and at last Pd(PPh)₃)₄ (3.27 g, 2.83 mmol). After 18 hours at room temperature, dowex bicarbonate (20 mg) was added and the mixture was stirred for a further 1 h. Filtration and evaporation of the volatiles under reduced pressure gave a residue that was purified by flash chromatography on silica gel (CH₂Cl₂, CH₂Cl₂:EtOAc 1:1, EtOAc:MeOH 9:1) The expected product (7) was obtained as a beige solid in quantitative yield. ¹H NMR (D₂O) δ 2.24-2.17 (m, 1H, H−2′), 2.41-2.37 (m, 1H, H−2′), 3.68 (dd, J=12.5, 5.0 Hz, 1H, H−5′), 3.77 (dd, J=12.5, 3.2 Hz, 1H, H−5′), 3.99 (m, 1H, H−4′), 4.27 (s, 2H, CH₂N), 4.34 (m, 1H, H−3′), 6.11 (t, J=6.3 Hz, 1H, H−1′), 8.1 (br s, 1H, NH); MS (ES): m/z (%) (M−H) 375 (100).

5′-O-(tert-Butyldimethylsilyl)-5-[3-(2,2,2-trifluoroacetamido)-prop-1-ynyl]-2′-deoxycytidine (8)

To a solution of the starting material (7) (1.0 g, 2.66 mmol) and imidazole (200 mg, 2.93 mmol) in DMF (3.0 ml) at 0° C., was slowly added TBDMSCl (442 mg, 2.93 mmol) in four portions over 1 h. After 2 h, the volatiles were evaporated under reduced pressure and the residue was adsorbed on silica gel and purified by flash chromatography (EtOAc, EtOAc:MeOH 9.5:0.5). The expected product (8) was isolated as a crystalline solid (826 mg, 64%). ¹H NMR (d₆ DMSO) δ 0.00 (s, 1H, CH₃); 0.01 (s, 1H, CH₃), 0.79 (s, 9 H, tBu), 1.87-1.80 (m, 1H, H−2′), 2.12 (ddd, J=13.0, 5.8 and 3.0 Hz, 1H, H−2′), 3.65 (dd, J=11.5, 2.9 Hz, 1H, H−5′), 3.74 (dd, J=11.5, 2.5 Hz, 1H, H−5′), 3.81-3.80 (m, 1H, H−4′), 4.10-4.09 (m, 1H, H−3′), 4.17 (d, 2H, J=5.1 Hz, NCH₂), 5.19 (d, 1H, J=4.0 Hz, 3′ —OH), 6.04 (t, J=6.6 Hz, 1H, H−1′), 6.83 (br s, 1H, NHH), 7.78 (br s, 1H, NHH), 7.90 (s, 1H, H−6), 9.86 (t, J=5.1 Hz, 1H, —H₂CNH); MS (ES): m/z (%) (MH)⁺ 491 (40%).

4-N-Acetyl-5′-O-(tert-butyldimethylsilyl)-3¹-O-(methylthiolmethyl)-5-[3-(2,2,2-trifluoroacetamide)-prop-1-ynyl]-2′-deoxycytidine (9)

To a solution of the starting material (8) (825 mg, 1.68 mmol) in DMSO (6.3 ml) and N₂ atmosphere, was slowly added acetic acid (AcOH) (1.3 ml, 23.60 mmol) followed by acetic anhydride (Ac₂O) (4.8 ml, 50.50 mmol). The solution was stirred at room temperature for 18 h and quenched at 0° C. by addition of saturated NaHCO₃ (20 ml). The product was extracted into EtOAc (3×30 ml), organic extracts combined, dried (MgSO₄), filtered and the volatiles evaporated. The crude residue was purified by flash chromatography on silica gel (EtOAc:petroleum ether 1:1) to give the expected product as a colourless oil (9) (573 mg, 62%). ¹H NMR (d₆ DMSO) δ 0.00 (s, 6H, 2×CH₃), 0.78 (s, 9H, tBu), 2.01 (s, 3H, SCH₃), 2.19-1.97 (m, 2H, 2×H2′), 2.25 (s, 3H, COCH₃), 3.67 (dd, 1H, J=11.5 Hz, H−5′), 3.78 (dd, 1H, J=11.5, 3.3 Hz, H−5′), 4.06-4.05 (m, 1H, H−4′), 4.17 (d, 2H, J=5.1 Hz, N—CH₂), 4.30-4.28 (m, 1H, H−3′), 4.63 (s, 2H, CH₂—S), 5.94 (t, 1H, J=6.5 Hz, H−1′), 8.17 (s, 1H, H−6), 9.32 (s, 1H, NHCO), 9.91 (t, 1H, J=5.4 Hz, NHCH₂); MS (ES): m/z (%) (MH)⁺ 593.

4-N-Acetyl-3′-O-(azidomethyl)-5′-O-(tert-butyldimethylsilyl)-5-[3-(2,2,2-trifluoroacetamide)-prop-1-ynyl]-2′-deoxycytidine (10)

To a solution of the starting material (9) (470 mg, 0.85 mmol) in dicloromethane (DCM) (8 ml) under N₂ atmosphere and cooled to 0° C., was added cyclohexene (430 μl, 4.27 mmol) followed by SO₂Cl₂ (1 M in DCM, 1.0 ml, 1.02 mmol). The solution was stirred for 30 minutes at 0° C., and the volatiles were evaporated. Residue immediately dissolved in DMF (8 ml) stirred under N₂ and sodium azide (275 mg, 4.27 mmol) slowly added. After 18 h, the crude product was evaporated to dryness, dissolved in EtOAc (30 ml) and washed with Na₂CO₃ (3×5 ml). The combined organic layer was kept separately. A second extraction of the product from the aqueous layer was performed with DCM (3×10 ml). All the combined organic layers were dried (MgSO₄), filtered and the volatiles evaporated under reduced pressure to give an oil identified as the expected product (10) (471 mg, 94% yield). This was used without any further purification. ¹H NMR (d₆ DMSO) δ 0.11 (s, 3H, CH₃), 0.11 (s, 3H, CH₃), 0.88 (s, 9H, ^(t)Bu), 2.16-2.25 (m, 1H, H−2′), 2.35 (s, 3H, COCH₃), 2.47-2.58 (m, 1H, H−2′), 3.79 (dd, J=11.6, 3.2 Hz, 1H, H−5′), 3.90 (dd, J=11.6, 3.0 Hz, 1H, H−5′), 4.17-4.19 (m, 1H, H−4′), 4.28 (s, 2H, NCH₂), 4.32-4.35 (m, 1H, H−3′), 4.89 (dd, J=14.4, 6.0 Hz, 2H, CH₂—N₃), 6.05 (t, J=6.4 Hz, 1H, H−1′), 8.25 (s, 1H, H−6), 9.46 (br s, 1H, NHH), 10.01 (br s, 1H, NHH).

4-N-Acetyl-3′-O-(azidomethyl)-5-[3-(2,2,2-trifluoroacetamido)-prop-1-ynyl]-2′-deoxycytidine and 3′-O-(Azidomethyl)-5-[3-(2,2,2-trifluoroacetamido)-prop-1-ynyl]-2′-deoxycytidine (11)

To a solution of the starting material (11) (440 mg, 0.75 mmol) in THF (20 ml) at 0° C. and N₂ atmosphere, was added TBAF in THF 1.0 M (0.82 ml, 0.82 mmol). After 1.5 h, the volatiles were evaporated under reduced pressure and the residue purified by flash chromatography on silica gel (EtOAc:petroleum ether 8:2 to EtOAc 100% to EtOAc:MeOH 8:2). Two compounds were isolated and identified as above described. The first eluted 4-N-Acetyl (11), (53 mg, 15%) and, the second one 4-NH₂ (12) (271 mg, 84%).

Compound 4-N-Acetyl (11): ¹H NMR (d₆ DMSO) δ 1.98 (s, 3H, CH₃CO), 2.14-2.20 (m, 2H, HH-2′), 3.48-3.55 (m, 1H, H−5′), 3.57-3.63 (m, 1H, H−5′), 3.96-4.00 (m, 1H, H−4′), 4.19 (d, J=5.3 Hz, 2H, CH₂—NH), 4.23-4.28 (m, 1H, H−3′), 4.77 (s, 2H, CH₂—N₃), 5.2 (t,1H, J=5.1 Hz, 5′-OH), 5.95 (t, J=6.2 Hz, 1H, H−1′), 8.43 (s, 1H, H−6), 9.34 (s, 1H, CONH), 9.95 (t, J=5.3 Hz, 1H, NHCH₂).

Compound 4-NH₂ (12): ¹H NMR (d₆ DMSO) δ 1.98-2.07(2H, CHH-2′), 3.50-3.63 (m, 2H, CHH-5′), 3.96-4.00 (m, 1H, H−4′), 4.09 (d, J=5.3 Hz, 2H, CH₂—NH), 4.24-4.28 (m, 1H, H−3′), 4.76 (s, 2H, CH₂—N₃), 5.13 (t, J=5.3 Hz, 1H, 5′-OH), 5.91 (br s, 1H, NHH), 6.11 (t, J=6.4 Hz, 1H, H−1′), 8.20 (t, J=5.3 Hz, 1H, NCH₂), 8.45 (s, 1H, H−6), 11.04 (br s, 1H, NHH).

4-N-Benzoyl-5′-O-(tert-butyldimethylsilyl)-5-[3-(2,2,2-trifluoroacetamido)-prop-1-ynyl]-2′-deoxycytidine (13)

The starting material (8) (10 g, 20.43 mmol) was azeotroped in dry pyridine (2×100 ml) then dissolved in dry pyridine (160 ml) under N₂ atmosphere. Chlorotrimethylsilane (10 ml, 79.07 mmol) added drop wise to the solution and stirred for 2 hours at room temperature. Benzoyl chloride (2.6 ml, 22.40 mmol) was then added to solution and stirred for one further hour. The reaction mixture was cooled to 0° C., distilled water (50 ml) added slowly to the solution and stirred for 30 minutes. Pyridine and water were evaporated from mixture under high vacuum to yield a brown gel that was portioned between 100 ml of sat. aq. NaHCO₃ (100 ml) solution DCM. The organic phase was separated and the aqueous phase extracted with a further (2×100 ml) of DCM. The organic layers were combined, dried (MgSO₄), filtered and the volatiles evaporated under reduced pressure. The resulting brown oil was purified by flash chromatography on silica gel (DCM:MeOH 99:1 to 95:5) to yield a light yellow crystalline solid (13) (8.92 g, 74%). ¹H NMR (d₆ DMSO): δ 0.00 (s, 6H, CH₃), 0.78 (s, 9H, tBu), 1.94 (m, 1H, H−2′), 2.27 (m, 1H, H−2′), 3.64 (d, 1H, J=11.6 Hz, H−5′), 3.75 (d, 1H, J=11.6 Hz, H−5′), 3.91 (m, 1H, H−4′), 4.09 (br m, 3H, CH₂NH, H−3′), 5.24 (s, 1H, 3′-OH), 6.00 (m, 1H, H−1′), 7.39 (m, 2H, Ph), 7.52 (m, 2H, Ph), 7.86 (m, 1H, Ph), 8.0 (s, 1H, H−6), 9.79 (t, 1H, J=5.4 Hz, NHCH₂), 12.67 (br s, 1H, NH). Mass (+ve electrospray) calcd for C₂₇H₃₃F₃N₄O₆Si 594.67, found 595.

4-N-Benzoyl-51-O-(tert-butyldimethylsilyl)-3′-O-methylthiomethyl-5-[3-(2,2,2-trifluoroacetamido)-prop-1-ynyl]-2′-deoxycytidine (14)

The starting material (13) (2.85 g, 4.79 mmol) was dissolved in dry DMSO (40 ml) under N₂ atmosphere. Acetic acid (2.7 ml, 47.9 mmol) and acetic anhydride (14.4 ml, 143.7 mmol) were added sequentially and slowly to the starting material, which was then stirred for 18 h at room temperature. Saturated NaHCO₃ (150 ml) solution was carefully added to the reaction mixture. The aqueous layer was extracted with EtOAc (3×150 ml). The organic layers were combined, dried (MgSO₄), filtered and evaporated to yield an orange liquid that was subsequently azeotroped with toluene (4×150 ml) until material solidified. Crude residue purified on silica gel (petroleum ether:EtOAc 3:1 to 2:1) to yield a yellow crystalline solid (14) (1.58 g, 50%). ¹H NMR (d₆ DMSO): δ 0.00 (s, 6H, CH₃), 0.78 (s, 9H, tBu), 1.99 (s, 3H, CH₃), 2.09 (m, 1H, H−2′), 2.28 (m, 1H, H−2′), 3.66 (d, 1H, J=11.5, 2.9 Hz, H−5′), 3.74 (dd, 1H, J=11.3, 2.9 Hz, H−5′), 3.99 (m, 1H, H−4′), 4.09 (m, 1H, CH₂NH), 4.29 (m, 1H, H−3′), 4.61 (s, 2H, CH₂S), 6.00 (m, 1H, H−1′), 7.37 (m, 2H, Ph), 7.50 (m, 2H, Ph), 7.80 (d, 1H, J=7.55 Hz, H_(Ar)), 7.97 (s, 1H, H−6), 9.79 (br t, 1H, NHCH₂), 12.64 (br s, 1H, NH). Mass (−ve electrospray) calcd for C₂₉H₃₇F₃N₄O₆SSi 654.79, found 653.2.

4-N-Benzoyl-5′-O-(tert-butyldimethylsilyl)-3′-O-azidomethyl-5-[3-(2,2,2-trifluoroacetamido)-prop-1-ynyl]-2′-deoxycytidine (15)

The starting material (14) (1.65 g, 2.99 mmol) was dissolved in DCM (18 ml) and cooled to 0° C. Cyclohexene (1.5 ml, 14.95 mmol) and SO₂Cl₂ (0.72 ml, 8.97 mmol) were added and stirred 1 h in ice bath. TLC indicated starting material still to be present whereupon a further aliquot of SO₂Cl₂ (0.24 ml) was added and the mixture stirred for 1 h at 0° C. Volatiles were removed by evaporation to yield a light brown solid that was redissolved in 18 ml of dry DMF (18 ml) under N₂. Sodium azide (0.97 g, 14.95 mmol) was then added to the solution and stirred for 2.5 h at room temperature. The reaction mixture was passed through a pad of silica and eluted with EtOAc and the volatiles removed by high vacuum evaporation. The resulting brown gel was purified by flash chromatography (petroleum ether:EtOAc 4:1 to 2:1) to yield the desired product as a white crystalline solid (15) (0.9 g, 55%). ¹H NMR (d₆ DMSO): δ 0.00 (s, 6H, CH₃), 0.78 (s, 9H, tBu), 2.16 (m, 1H, H−2′), 2.22 (m, 1H, H−2′), 3.70 (d, 1H, J=11.5 Hz, H−5′), 3.75 (d, 1H, J=11.3 Hz, H−5′), 4.01 (m, 1H, H−4′), 4.10 (m, 1H, CH₂NH), 4.23 (m, 1H, H−3′), 4.76 (s, 2H, CH₂S), 5.99 (m, 1H, H−1′), 7.37 (m, 2H, Ph), 7.50 (m, 2H, Ph), 7.81 (d, 1H, J=7.4 Hz, Ph), 7.95 (s, 1H, H−6), 9.78 (br s, 1H, NHCH₂), 12.64 (br s, 1H, NH). Mass (−ve electrospray) calcd. for C₂₈H₃₄F₃N₇O₆Si 649.71, found 648.2

4-N-Benzoyl-3′-O-azidomethyl-5-[3-(2,2,2-trifluoroacetamido)-prop-1-ynyl]-2′-deoxycytidine (16)

The starting material (15) (140 mg, 0.22 mmol) was dissolved in THF (7.5 ml). TBAF (1M soln. in THF, 0.25 ml) was added slowly and stirred for 2 h at room temperature. Volatile material removed under reduced pressure to yield a brown gel that was purified by flash chromatography (EtOAc:DCM 7:3) to yield the desired product (16) as a light coloured crystalline solid (0.9 g, 76%). ¹H NMR (d₆ DMSO): δ 2.16 (m, 1H, H−2′), 2.22 (m, 1H, H−2′), 3.70 (d, 1H, J=11.5 Hz, H−5′), 3.75 (d, 1H, J=11.3 Hz, H−5′), 4.01 (m, 1H, H−4′), 4.10 (m, 1H, CH₂NH), 4.23 (m, 1H, H−3′), 4.76 (s, 2H, CH₂S), 5.32 (s, 1H, 5′ OH), 5.99 (m, 1H, H−1′), 7.37 (m, 2H, Ph), 7.50 (m, 2H, Ph), 7.81 (d, 1H, J=7.35 Hz, Ph), 7.95 (s, 1H, H−6), 9.78 (br s, 1H, NHCH₂), 12.64 (br s, 1H, NH). Mass (−ve electrospray) calcd for C₂₂H₂₀F₃N₇O₆ 535.44, found 534.

5-(3-Amino-prop-1-ynyl)-3′-O-azidomethyl-2′-deoxycytidine 5′-O-nucleoside triphosphate (17)

To a solution of (11) and (12) (290 mg, 0.67 mmol) and proton sponge (175 mg, 0.82 mmol) (both previously dried under P₂O₅ for at least 24 h) in PO(OMe)₃ (600 μl), at 0° C. under Argon atmosphere, was slowly added POCl₃ (freshly distilled) (82 μl, 0.88 mmol). The solution was vigorously stirred for 3 h at 0° C. and then quenched by addition of tetra-tributylammonium diphosphate (0.5 M) in DMF (5.2 ml, 2.60 mmol), followed by nBu₃N (1.23 ml, 5.20 mmol) and triethylammonium bicarbonate (TEAB) 0.1 M (20 ml). After 1 h at room temperature aqueous ammonia solution (ρ0.88, 20 ml) was added to the mixture. Solution stirred at room temperature for 15 h, volatiles evaporated under reduced pressure and the residue was purified by MPLC with a gradient of TEAB from 0.05M to 0.7M. The expected triphosphate was eluted from the column at approx. 0.60 M TEAB. A second purification was done by HPLC in a Zorbax SB-C18 column (21.2 mm i.d.×25 cm) eluted with 0.1M TEAB (pump A) and 30% CH₃CN in 0.1M TEAB (pump B) using a gradient as follows: 0-5 min 5% B, Φ0.2 ml; 5-25 min 80% B, Φ0.8 ml; 25-27 min 95% B, Φ0.8 ml; 27-30 min 95% B, Φ0.8 ml; 30-32 min 5% B, Φ0.8 ml; 32-35 min 95% B, Φ0.2 ml, affording the product described above with a r_(t)(17): 20.8 (14.5 μmols, 2.5% yield); ³¹P NMR (D₂O, 162 MHz) δ −5.59 (d, J=20.1 Hz, P_(χ)), −10.25 (d, J=19.3 Hz, 1P, P_(α)), −20.96 (t, J=19.5 Hz, 1P, P_(β)); ¹H NMR (D₂O) δ 2.47-2.54 (m, 1H, H−2′), 2.20-2.27 (m, 1H, H−2′), 3.88 (s, 2H, CH₂N), 4.04-4.12 (m, 1H, HH-5′), 4.16-4.22 (m, 1H, HH-5′), 4.24-4.30 (m, 1H, H−4′), 4.44-4.48 (m, 1H, H−3′), 6.13 (t, J=6.3 Hz, 1H, H−1′), 10 8.35 (s, 1H, H−6); MS (ES): m/z (%) (M−H) 574 (73%), 494 (100%)

Alexa488 Disulfide Linker

Commercial available Alexa Fluor 488-NHS (35 mg, 54 μmol) was dissolved in DMF (700 μL) and, to ensure full activation, 4-DMAP (7 mg, 59 μmol) and N,N′-disuccinimidyl carbonate (15 mg, 59 μmol) were sequentially added. After 15 min on complete activation, a solution of the starting disulfide (32.0 mg, 108 μmol) in DMF (300 μL) containing diisopropylethylamine (4 μL) was added over the solution of the activated dye. Further addition of diisopropylethylamine (20 μL) to the final mixture was done, ultrasonicated for 5 min and reacted for 18 h at room temperature in the darkness. The volatiles were evaporated under reduced pressure and the crude residue was first purified passing it through a short ion exchange resin Sephadex-DEAE A-25 (40-120μ) column, first eluted with TEAB 0.1 M (25 ml) then 1.0 M TEAB (75 ml). The latest containing the two final compounds was concentrated and the residue was HPLC purified in a Zorbax SB-C18 column (21.2 mm i.d.×25 cm) eluted with 0.1M TEAB (pump A) and CH₃CN (pump B) using a gradient as follows: 0-2 min 2% B, Φ0.2 ml; 2-4 min 2% B, Φ0.8 ml; 4-15 min 23% B, Φ0.8 ml; 15-24 min 23% B, Φ0.8 ml; 24-26 min 95% B, Φ0.8 ml; 26-28 min 95% B, Φ0.8 ml, 28-30 min 2% B, Φ0.8 ml, 30-33 min 2% B, Φ0.2 ml affording both compounds detailed above with t_(r): 19.0 (left regioisomer) and t_(r): 19.5 (right regioisomer). Both regioisomers were respectively passed through a dowex ion exchange resin column, affording respectively 16.2 μmol and 10.0 μmol, 62% total yield (based in commercial available Alexa Fluor 488-NHS of 76% purity); ε₄₉₃=71,000 cm⁻¹ M⁻¹ in H₂O. ¹H NMR (D₂O) (left regioisomer) δ 2.51 (t, J=6.8 Hz, 2H, CH₂), 2.66 (t, J=6.8 Hz, 2H, CH₂), 2.71 (t, J=5.8 Hz, 2H, CH₂), 3.43 (t, J=5.8 Hz, 2H, CH₂), 6.64 (d, J=9.2 Hz, 2H, H_(Ar)), 6.77 (d, J=9.2 Hz, 2H, H_(Ar)), 7.46 (s, 1H, H_(Ar)), 7.90 (dd, J=8.1 and 1.5 Hz, 1H, H_(Ar)), 8.20 (d, J=8.1 Hz, 1H, H_(Ar)). ¹H NMR (D₂O) (right regioisomer) δ 2.67 (t, J=6.8 Hz, 2H, CH₂), 2.82 (t, J=6.8 Hz, 2H, CH₂), 2.93 (t, J=6.1 Hz, 2H, CH₂), 3.68 (t, J=6.1 Hz, 2H, CH₂), 6.72 (d, J=9.3 Hz, 2H, H_(Ar)), 6.90 (d, J=9.3 Hz, 2H, H_(Ar)), 7.32 (d, J=7.9 Hz, 1H, H_(Ar)), 8.03 (dd, J=7.9, 1.7 Hz, 1H, H_(Ar)), 8.50 (d, J=1.8 Hz, 1H, H_(Ar)) Mass (−ve electrospray) calcd for C₂₆H₂₃N₃O₁₂S₄ 697.02, found 692 (M−H), 347 [M/2].

To a solution of Alexa Fluor 488 disulfide linker (3.4 μmol, 2.37mg) in DMF (200 μL) was added 4-DMAP (0.75 mg, 5.1 μmol) and N,N-disuccinimidyl carbonate (1.70 mg, 5.1 μmol). The mixture was stirred for 15 to full activation of the acid, then it was added into the solution of the nucleotide (17) (3.45 mg, 6.0 μmol) in DMF (0.3 ml) containing nBu₃N (40 μL) at 0° C. The mixture was sonicated for 3 min and then continuously stirred for 16 h in the absence of light. The volatiles were evaporated under reduced pressure and the residue was firstly purified by filtration through a short ion exchange resin Sephadex-DEAE A-25 column, first eluted with TEAB 0.1 M (50 ml) removing the unreacted dye-linker, then 1.0 M TEAB (100 ml) to collect the expected product (18). After concentration and the residue was HPLC purified in a Zorbax SB-C18 column (21.2 mm i.d.×25 cm) eluted with 0.1M TEAB (pump A) and CH₃CN (pump B) using a gradient as follows: 0-2 min 2% B, Φ0.2 ml; 2-4 min 2% B, Φ0.8 ml; 4-15 min 23% B, Φ0.8 ml; 15-24 min 23% B, Φ0.8 ml; 24-26 min 95% B, Φ0.8 ml; 26-28 min 95% B, Φ0.8 ml, 28-30 min 2% B, Φ0.8 ml, 30-33 min 2% B, Φ0.2 ml affording the product detailed above with a r_(t)(18): 19.8 (0.26 μmols, 12% yield based on UV measurement); λ_(max)=493 nm, ε 71,000 cm⁻¹ M⁻¹ in H₂O); ³¹P NMR (D₂O, 162 MHz) δ −5.06 (d, J=20.6 Hz, 1P, P_(χ)), −10.25 (d, J=19.3 Hz, 1P, P_(α)), −21.21 (t, J=19.5 Hz, 1P, P_(β)) ¹H NMR (D₂O) δ −2.09-2.17 (m, 1H, HH-2′), 2.43-2.50 (m, 1H, HH-2′), 2.61 (t, J=6.8 Hz, 2H, H₂C—S), 2.83 (2H, S—CH₂), 3.68 (t, J=6.0 Hz, 2H, ArCONCH₂), 4.06 (s, 2H, CH₂N), 4.08-4.17 (m, 4H, HH-5′), 4.25-4.29 (m, 1H, H−4′), 4.46-4.50 (m, 1H, H−3′), 6.09 (t, J=6.4 Hz, 1H, H−1′), 6.88 (d, J=9.1 Hz, 1H, H_(Ar)), 6.89 (d, J=9.3 Hz, 1H, H_(Ar)), 7.15 (d, J=9.3 Hz, 1H, H_(Ar)), 7.17 (d, J=9.1 Hz, 1H, H_(Ar)), 7.64 (br s, 1H, H_(Ar)), 8.00-7.94. (m, 2H, H_(Ar)), 8.04 (s, 1H, H−6); MS (ES): m/z (%) (M−H) 1253 (46%), (M−H+Na) 1275 (100%).

7-Deaza-[3-(2,2,2-trifluoroacetamido)-prop-1-ynyl]-2′-deoxyguanosine (19)

Under N₂, a suspension of 7-deaza-7-iodo-guanosine (2 g, 2.75 mmol), Pd(PPh₃)₄ (582 mg, 0.55 mmol), CuI (210 mg, 1.1 mmol), Et₃N (1.52 ml, 11 mmol) and the propagylamine (2.5 g, 16.5 mmol) in DMF (40 ml) was stirred at room temperature for 15 h under N₂. The reaction was protected from light with aluminium foil. After TLC indicating the full consumption of starting material, the reaction mixture was concentrated. The residue was diluted with MeOH (20 ml) and treated with dowex-HCO₃ ⁻. The mixture was stirring for 30 min and filtered. The solution was concentrated and purified by silica gel chromatography (petroleum ether:EtOAc 50:50 to petroleum ether: EtOAc:MeOH 40:40:20), giving (19) as a yellow powder (2.1 g, 92%). ¹H NMR (d₆ DMSO) δ 2.07-2.11 (m, 1H, H−2′), 2.31-2.33 (m, 1H, H−2′), 3.49-3.53 (m, 2H, H−5′), 3.77 (br s, 1H, H−4′), 4.25 (d, J=4.3 Hz, 2H, ≡CCH₂), 4.30 (br s, 1H, H−3′), 4.95 (t, J=5.2 Hz, 1H, 5′-OH), 5.25 (d, J=3.4 Hz, 1H, 3′-OH), 6.27-6.31 (m, 1H, H−1′), 6.37 (s, 2H, NH₂), 7.31 (s, 1H, H−8), 10.10 (br s, 1H, NHCOCF₃), 10.55 (s, 1H, NH). Mass (−ve electrospray) calcd for C₁₆H₁₆F₃N₅O₅ 415, found 414.

5′-O-(tert-Butyldiphenyl)-7-deaza-7-[3-(2,2,2-trifluoroacetamido)-prop-1-ynyl]-2′-deoxyguanosine (20)

A solution of (19) (2.4 g, 5.8 mmol) in pyridine (50 ml) was treated with tert-butyldiphenylsilyl chloride (TBDPSCl) (1.65 ml, 6.3 mmol) drop wise at 0° C. The reaction mixture was then warmed to room temperature. After 4 h, another portion of TBDPSCl (260 μL, 1 mmol) was added. The reaction was monitored by TLC, until full consumption of the starting material. The reaction was quenched with MeOH (˜5 ml) and evaporated to dryness. The residue was dissolved in DCM and aq. sat. NaHCO₃ was added. The aqueous layer was extracted with DCM three times. The combined organic extracts were dried (MgSO₄) and concentrated under vacuum. Purification by chromatography on silica (EtOAc to EtOAc:MeOH 85:15) gave (20) a yellow foam (3.1 g, 82%). ¹H NMR (d₆ DMSO) δ 1.07 (s, 9H, CH₃), 2.19-2.23 (m, 1H, H−2′), 2.38-2.43 (m, 1H, H−2′), 3.73-3.93 (m, 2H, H−5′), 4.29 (d, J=5.0 Hz, 2H, CH₂N), 4.42-4.43 (m, 1H, H−3′), 5.41 (br s, 1H, OH), 6.37 (t, J=6.5 Hz, H−1′), 6.45 (br s, 2H, NH₂), 7.24-7.71 (m, 11H, H−8, H_(Ar)), 10.12 (t, J=3.6 Hz, 1H, NH), 10.62 (s, 1H, H−3). Mass (+ve electrospray) calcd for C₃₂H₃₄F₃N₅O₅Si 653, found 654.

5′-O-(tert-Butyldiphenyl)-7-deaza-3′-O-methylthiolmethyl-7-[3-(2,2,2-trifluoroacetamido)-prop-1-ynyl]-2′-deoxyguanosine (21)

A solution of (20) (1.97 g, 3.0 mmol) in DMSO (15 ml) was treated with Ac₂O (8.5 ml, 90 mmol), and AcOH (2.4 ml, 42 mmol) and stirred at room temperature for 15 h, then 2 h at 40° C. The reaction mixture was diluted with EtOAc (200 ml) and stirred with sat, aq. NaHCO₃ (200 ml) for 1 h. The aqueous layer was washed with EtOAc twice. The organic layer was combined, dried (MgSO₄) and concentrated under vacuum. Purification by chromatography on silica (EtOAc:Hexane 1:1 to EtOAc:Hexane:MeOH 10:10:1) gave (21) as a yellow foam (1.3 g, 60%). ¹H NMR (CDCl₃) δ 1.04 (s, 9H, CH₃), 2.08 (s, 3H, SCH₃), 2.19-2.35 (m, 2H, H−2), 3.67-3.71 (m, 2H, H−5′), 3.97-3.99 (m, 2H, H−4′, H−3′), 4.23 (br s, 2H, CH₂N), 4.58 (s, 2H, CH₂S), 6.31 (dd, J=5.7, 7.9 Hz, H−1′), 7.19-7.62 (m, 11H, H8, H_(Ar)). Mass (+ve electrospray) calcd for C₃₄H₃₈F₃N₅O₅SSi 713, found: 714.

3′-O-Azidomethyl-7-deaza-7-[3-(2,2,2-trifluoroacetamido)-prop-1-ynyl]-2′-deoxyguanosine (22)

To a solution of (21) (1.3 mg, 1.8 mmol), cyclohexene (0.91 ml, 9 mmol) in CH₂Cl₂ (10 ml) in 4° C., sulfurylchloride (1M in CH₂Cl₂) (1.1 ml, 1.1 mmol) was added drop wise under N₂. After 30 min., TLC indicated the full consumption of the nucleoside (22). After evaporation to remove the solvent, the residue was then subjected to high vacuum for 20 min, and then treated with NaN₃ (585 mmol, 9 mmol) and DMF (10 ml). The resulted suspension was stirred under room temperature for 2 h. Extraction with CH₂Cl₂/NaCl (10%) gave a yellow gum, which was treated with TBAF in THF (1 M, 3 ml) and THF (3 ml) at room temperature for 20 min. Evaporation to remove solvents, extraction with EtOAc/sat. aq. NaHCO₃, followed by purification by chromatography on silica (EtOAc to EtOAc:MeOH 9:1) gave (22) as a yellow foam (420 mg, 50%). ¹H NMR (d₆ DMSO): δ 2.36-2.42 (m, 1H, H−2′), 2.49-2.55 (m, 1H, H−2′), 3.57-3.59 (m, 2H, H−5′), 3.97-4.00 (m, 1H, H−4′), 4.29 (m, 2H, CH₂N), 4.46-4.48 (m, 1H, H−3′), 4.92-4.96 (m, 2H, CH₂N₃), 5.14 (t, J=5.4 Hz, 1H, 5′-OH), 5.96-6.00 (dd, J=5.7, 8.7 Hz, 1H, H−1′), 6.46 (br s, 2H, NH₂), 7.39 (s, 1H, H−6), 10.14 (8, 1H, NH), 10.63 (s, 1H, H−3)

3′-O-Azidomethyl-7-deaza-7-[3-(2,2,2-trifluoroacetamido)-prop-1-ynyl]-2′-deoxyguanosine 5′-O-nucleoside triphosphate (23)

Tetrasodium diphosphate decahydrate (1.5 g, 3.4 mmol) was dissolved in water (34 ml) and the solution was applied to a column of dowex 50 in the H⁺ form. The column was washed with water. The eluent dropped directly into a cooled (ice bath) and stirred solution of tri-n-butyl amine (1.6 ml, 6.8 mmol) in EtOH (14 ml). The column was washed until the pH of the eluent increased to 6. The aqueous ethanol solution was evaporated to dryness and then co-evaporated twice with ethanol and twice with anhydrous DMF. The residue was dissolved in DMF (6.7 ml). The pale yellow solution was stored over 4 Å molecular sieves. The nucleoside (22) and proton sponge was dried over P₂O₅ under vacuum overnight. A solution of (22) (104 mg, 0.22 mmol) and proton sponge (71 mg, 0.33 mmol) in trimethylphosphate (0.4 ml) was stirred with 4 Å molecular sieves for 1 h. Freshly distilled POCl₃ (25 μl, 0.26 mmol) was added and the solution was stirred at 4° C. for 2 h. The mixture was slowly warmed up to room temperature and bis (tri-n-butyl ammonium) pyrophosphate (1.76 ml, 0.88 mmol) and anhydrous tri-n-butyl amine (0.42 ml, 1.76 mmol) were added. After 5 min, the reaction was quenched with 0.1 M TEAB (triethylammonium bicarbonate) buffer (15 ml) and stirred for 3 h. The water was removed under reduced pressure and the resulting residue dissolved in concentrated ammonia (ρ0.88, 10 ml) and stirred at room temperature for 16 h. The reaction mixture was then evaporated to dryness. The residue was dissolved in water and the solution applied to a DEAE-Sephadex A-25 column. MPLC was performed with a linear gradient of 2 L each of 0.05 M and 1 M TEAB. The triphosphate was eluted between 0.7 M and 0.8 M buffer. Fractions containing the product were combined and evaporated to dryness. The residue was dissolved in water and further purified by HPLC. t_(r)(23)=20.5 min (Zorbax C18 preparative column, gradient: 5% to 35% B in 30 min, buffer A 0.1M TEAB, buffer B MeCN). The product was isolated as a white foam (225 O.D., 29.6 μmol, 13.4%, ε₂₆₀=7,600). ¹H NMR (D₂O) δ 2.43-2.5 (m, 2H, H−2′), 3.85 (m, 2H, CH₂N), 3.97-4.07 (m, 2H, H−5′), 4.25 (br s, 1H, H−4′), 4.57 (br s, 1H, H−3′), 4.74-4.78 (m, 2H, CH₂N₃), 6.26-6.29 (m; 1H, H−1′), 7.41 (s, 1H, H−8). ³¹P-NMR (D₂O) δ −8.6 (m, 1P, P_(γ)), −10.1 (d, J=19.4 Hz, 1P, P_(α)), −21.8 (t, J=19.4 Hz, 1P, P_(β)). Mass (−ve electrospray) calcd for C₁₅H₂₁N₈O₁₃P₃ 614, found 613.

A mixture of disulphide linkered-Cy3 (2.5 μmol), 1-(3-dimethylaminopropyl)-3-ethylcarbodiimide hydrochloride (EDC) (0.95 mg, 5 μmol), 1-hydroxybenzotriazole (HOBt) (0.68 mg, 5 μmol) and N-methyl-morpholine (0.55 μL, 5 μmol) in DMF (0.9 ml) was stirred at room temperature for 1 h. A solution of (23) (44 O.D., 3.75 μmol) in 0.1 ml water was added to the reaction mixture at 4° C., and left at room temperature for 3 h. The reaction was quenched with TEAB buffer (0.1M, 10 ml) and loaded on a DEAE Sephadex column (2×5 cm). The column was first eluted with 0.1 M TEAB buffer (100 ml) and then 1 M TEAB buffer (100 ml). The desired triphosphate product was eluted out with 1 M TEAB buffer. Concentrating the fraction containing the product and applied to HPLC. t_(r)(24)=23.8 min (Zorbax C18 preparative column, gradient: 5% to 55% B in 30 min, buffer A 0.1M TEAB, buffer B MeCN). The product was isolated as a red foam (0.5 μmol, 20%, ε_(max)=150,000). ¹H NMR (D₂O) δ 1.17-1.71 (m, 20H, 4×CH₂, 4×CH₃), 2.07-2.15 (m, 1H, H−2′), 2.21-2.30 (m, 1H, H−2′), 2.52-2.58 (m, 2H, CH₂), 2.66-2.68 (m, 2H, CH₂), 2.72-2.76 (m, 2H, CH₂), 3.08-3.19 (m, 2H, CH₂), 3.81-3.93 (m, 6H, CH₂, H−5′), 4.08-4.16 (m, 1H, H−4′), 4.45-4.47 (m, 1H, H−3′), 4.70-4.79 (m, 2H, CH₂N₃), 6.05-6.08 (m, 2H, H_(Ar)), 6.15-6.18 (m, 1H, H−1′), 7.11 (s, 1H, H−8), 7.09-7.18 (m, 2H, CH), 7.63-7.72 (m, 4H, H_(Ar)), 8.27-8.29 (m, 1H, CH). ³¹P NMR (D₂O) δ −4.7 (m, 1P, P_(γ)), −9.8 (m, 1P, P_(α)), −19.7 (m, 1P, P_(β)). Mass (−ve electrospray) calcd for C₅₁H₆₆N₁₁O₂₁P₃S₄1389.25, found 1388 (M−H), 694 [M−2H], 462 [M−3H].

7-Deaza-7-[3-(2,2,2-trifluoroacetamido)-prop-1-ynyl]-2′-deoxyadenosine (25)

To a suspension of 7-deaza-7-iodo-2′-deoxyadenosine (1 g, 2.65 mmol) and CuI (100 mg, 0.53 mmol) in dry DMF (20 ml) was added triethylamine (740 μl, 5.3 mmol). After stirring for 5 min trifluoro-N-prop-2-ynyl-acetamide (1.2 g, 7.95 mmol) and Pd(PPh₃)₄ (308 mg, 0.26 mmol) were added to the mixture and the reaction was stirred at room temperature in the dark for 16 h. MeOH (40 ml) and bicarbonate dowex was added to the reaction mixture and stirred for 45 min. The mixture was filtered. The filtrate washed with MeOH and the solvent was removed under vacuum. The crude mixture was purified by chromatography on silica (EtOAc to EtOAc:MeOH 95:20) to give slightly yellow powder (25) (1.0 9, 95%-). ¹H NMR (d₆, DMSO) δ 2.11-2.19 (m, 1H, H−2′), 2.40-2.46 (m, 1H, H−2′), 3.44-3.58 (m, 2H, H−5′), 3.80 (m, 1H, H−4′), 4.29 (m, 3H, H−3′, CH₂N), 5.07 (t, J=5.5 Hz, 1H, OH), 5.26 (d, J=4.0 Hz, 1H, OH), 6.45 (dd, J=6.1, 8.1 Hz, 1H, H−1′) 7.74 (s, 1H, H−8), 8.09 (s, 1H, H−2), 10.09 (t, J=5.3 Hz, 1H, NH).

5′-O-(tert-Butyldiphenylsilyl)-7-deaza-7-[3-(2,2,2-trifluoroacetamido)-prop-1-ynyl]-2′-deoxyadenosine (26)

The nucleoside (25) (1.13 g, 2.82 mmol) was coevaporated twice in dry pyridine (2×10 ml) and dissolved in dry pyridine (18 ml). To this solution was added t-butyldiphenylsilylchloride (748 μl, 2.87 mmol) in small portions at 0° C. The reaction mixture was let to warm up at room temperature and left stirring overnight. The reaction was quenched with sat. aq. NaCl solution. EtOAc (25 ml) was added to reaction mixture and the aqueous layer was extracted with EtOAc three times. After drying the combined organic extracts (MgSO₄) the solvent was removed under vacuum. Purification by chromatography on silica (DCM then EtOAc to EtOAc:MeOH 85:15) gave (26) as a slightly yellow powder (1.76 g, 97%). ¹H NMR (d₆DMSO) δ 1.03 (s, 9H, tBu), 2.25-2.32 (m, 1H, H−2′), 2.06-2.47 (m, 1H, H−2′), 3.71-3.90 (m, 2H, H−5′), 3.90-3.96 (m, 1H, H−4′), 4.32 (m, 2H, CH₂N), 4.46 (m, 1H, H−3′), 5.42 (br s, 1H, OH), 6.53 (t, J=6.7 Hz, 1H, H−1′), 7.38-7.64 (m, 11H, H−8 and H_(Ar)), 8.16 (s, 1H, H−2), 10.12 (t, J=5.3 Hz, 1H, NH).

5′-O-(tert-Butyldiphenylsilyl)-7-deaza-4-N,N-dimethylformadin-7-[3-(2,2,2-trifluoroacetamido)-prop-1-ynyl]-2′-deoxyadenosine (27)

A solution of the nucleoside (26) (831 mg, 1.30 mmol) was dissolved in a mixture of MeOH:N,N-dimethylacetal (30 ml:3 ml) and stirred at 40° C. The reaction monitored by TLC, was complete after 1 h. The solvent was removed under vacuum. Purification by chromatography on silica (EtOAc:MeOH 95:5) gave (27) as a slightly brown powder (777 mg, 86%). ¹H NMR (d₆ DMSO) δ 0.99 (s, 9H, tBu), 2.22-2.29 (m, 1H, H−2′), 2.50-2.59 (m, 1H, H−2′), 3.13 (s. 3H, CH₃), 3.18 (s. 3H, CH₃), 3.68-3.87 (m, 2H, H−5′), 3.88-3.92 (m, 1H, H−4′), 4.25 (m, 2H, CH₂N), 4.43 (m, 1H, H−3′), 6.56 (t, J=6.6 Hz, 1H, H−1′), 7.36-7.65 (m, 10H, H_(Ar)), 7.71 (s, 1H, H−8), 8.33 (s, 1H, CH), 8.8 (s, 1H, H−2), 10.12 (t, J=5.3 Hz, 1H, NH).

5′-O-(tert-Butyldiphenylsilyl)-7-deaza-4-N,N-dimethylformadin-3′-O-methylthiomethoxy-7-[3-(2,2,2-trifluoroacetamido)-prop-1-ynyl]-2′-deoxyadenosine (28)

To a solution of (27) (623 mg, 0.89 mmol) in dry DMSO (8 ml) was added acetic acid (775 μl, 13.35 mmol) and acetic anhydride (2.54 ml, 26.7 mmol). The mixture was stirred overnight at room temperature. The reaction was then poured into EtOAc and sat. aq. NaHCO₃ (1:1) solution and stirred vigorously. The organic layer was washed one more time with sat. aq. NaHCO₃ and dried over MgSO₄. After removing the solvent under reduced pressure, the product (28) was purified by chromatography on silica (EtOAc:petroleum ether 1:2, then EtOAc) yielding (28) (350 mg, 52%) ¹H NMR (d₆ DMSO): δ 1.0 (s, 9H, tBu), 2.09 (s, 3H, SCH₃), 2.41-2.48 (m, 1H, H−2′), 2.64-2.72 (m, 1H, H−2′), 3.12 (s, 3H, CH₃), 3.17 (s, 3H, CH₃), 3.66-3.89 (m, 2H, H−5′), 4.04 (m, 1H, H−4¹), 4.26 (m, J=5.6 Hz, 2H, CH₂), 4.67 (m, 1H, H−3′), 4.74 (br s, 2H, CH₂), 6.49 (t, J=6.1, 8.1 Hz, 1H, H−1′), 7.37-7.48 (m, 5H, H_(Ar)), 7.58-7.67 (m, 5H, H_(Ar)), 7.76 (s, 1H, H−8), 8.30 (s, 1H, CH), 8.79 (s, 1H, H−2), 10.05 (t, J=5.6 Hz, 1H, NH).

3′-O-Azidomethyl-5′-O-(tert-butyldiphenylsilyl)-7-deaza-4-N,N-dimethylformadin-7-[3-(2,2,2-trifluoroacetamido)-prop-1-ynyl]-2′-deoxyadenosine (29)

To a solution of (28) (200 mg, 0.26 mmol) and cyclohexene (0.135 ml, 1.3 mmol) in dry CH₂Cl₂ (5 ml) at 0° C., sulfurylchoride (32 μl, 0.39 mmol) was added under N₂. After 10 min, TLC indicated the full consumption of the nucleoside (28). The solvent was evaporated and the residue was subjected to high vacuum for 20 min. It was then redissolved in dry DMF (3 ml), cooled to 0° C. and treated with NaN₃ (86 mg, 1.3 mmol). The resulting suspension was stirred under room temperature for 3 h. The reaction was partitioned between EtOAc and water. The aqueous phases were extracted with EtOAc. The combined organic extracts were combined and dried over MgSO₄. After removing the solvent under reduced pressure, the mixture was purified by chromatography on silica (EtOAc) yielding an oil (29) (155 mg, 80%) ¹H NMR (d₆ DMSO): δ 0.99 (s, 9H, tBu), 2.45-2.50 (m, 1H, H−2′), 2.69-2.78 (m, 1H, H−2′), 3.12 (s, 3H, CH₃), 3.17 (s, 3H, CH₃), 3.67-3.88 (m, 2H, H−5′), 4.06 (m, 1H, H−4′), 4.25 (m, 2H, CH₂), 4.61 (m, 1H, H−3′), 4.84-4.97 (m, 2H, CH₂), 6.58 (t, J=6.6 Hz, 1H, H−1′), 7.35-7.47 (m, 5H, H_(Ar)), 7.58-7.65 (m, 5H, H_(Ar)), 7.77 (s, 1H, H−8), 8.30 (s, 1H, CH), 8.79 (s, 1H, H−2), 10.05 (br s, 1H, NH).

3′-O-Azidomethyl-7-deaza-4-N,N-dimethylformadin-7-[3-(2,2,2-trifluoroacetamido)-prop-1-ynyl]-2′-deoxyadenosine (30)

A solution of (29) (155 mg, 0.207 mmol) in solution in tetrahydrofuran (THF) (3 ml) was treated with TBAF (1 M in THF, 228 μl) at 0° C. The ice-bath was then removed and the reaction mixture stirred at room temperature. After 2 h—TLC indicated the full consumption of the nucleoside. The solvent was removed. Purification by chromatography on silica (EtOAc:MeOH 95:5) gave (30) (86 mg, 82%) as a pale brown oil. ¹H NMR (d₆ DMSO) δ 2.40-2.48 (dd, J=8.1, 13.6 Hz, 1H, H−2′), 2.59-2.68 (dd, J=8.3, 14 Hz, 1H, H−2′), 3.12 (s, 3H, CH₃), 3.17 (s, 3H, CH₃), 3.52-3.62 (m, 2H, H−5′), 4.02 (m, 1H, H−4′), 4.28 (d, J=5.6 Hz, 2H, CH₂NH), 4.47 (m, 1H, H−3′), 4.89 (s, 2H, CH₂N₃), 5.19 (t, J=5.6 Hz, 1H, OH), 6.49 (dd, J=8.1, 8.7 Hz, 1H, H−1′), 7.88 (s, 1H, H−8), 8.34 (s, 1H, CH), 8.80 (s, 1H, H−2), 10.08 (s, 1H, NH).

7-(3-Aminoprop-1-ynyl)-3′-O-azidomethyl -7-deaza-2′-deoxyadenosine 5′-O-nucleoside triphosphate (31)

The nucleoside (30) and proton sponge was dried over P₂O₅ under vacuum overnight. A solution of (30) (150 mg, 0.294 mmol) and proton sponge (126 mg, 0.588 mmol) in trimethylphosphate (980 μl) was stirred with 4 Å molecular sieves for 1 h. Freshly distilled POCl₃ (36 μl, 0.388 mmol) was added and the solution was stirred at 4° C. for 2 h. The mixture was slowly warmed up to room temperature and bis (tri-n-butyl ammonium) pyrophosphate 0.5 M solution in DMF (2.35 ml, 1.17 mmol) and anhydrous tri-n-butyl amine (560 μl, 2.35 mmol) was added. After 5 min, the reaction was quenched with 0.1 M TEAB (triethylammonium bicarbonate) buffer (15 ml) and stirred for 3 h. The water was removed under reduced pressure and the resulting residue dissolved in concentrated ammonia (ρ0.88, 15 ml) and stirred at room temperature for 16 h. The reaction mixture was then evaporated to dryness. The residue was dissolved in water and the solution applied to a DEAE-Sephadex A-25 column. MPLC was performed with a linear gradient of 0.05 M to 1 M TEAB. Fractions containing the product were combined and evaporated to dryness. The residue was dissolved in water and further purified by HPLC. HPLC: t_(r)(31): 19.94 min (Zorbax C18 preparative column, gradient: 5% to 35% B in 20 min, buffer A 0.1M TEAB, buffer B MeCN). The product (31) was isolated as a white foam (17.5 μmol, 5.9%, ε₂₂₈=15000). ¹H NMR (D₂O) δ 2.67-2.84 (2m, 2H, H−2′), 4.14 (m, 2H, CH₂NH), 4.17-4.36 (m, 2H, H−5′), 4.52 (br s, H−4′), 6.73 (t, J=6.6 Hz, H−1′), 8.06 (s, 1H, H−8), 8.19 (s, 1H, H−2). ³¹P NMR (D₂O) δ −5.07 (d, J=21.8 Hz, 1P, P_(γ)), −10.19 (d, J=19.8 Hz, 1P, P_(α)), −21.32 (t, J=19.8 Hz, 1P, P_(β)). Mass (−ve electrospray) calcd for C₁₅H₂₁N₈O₁₂P₃ 598.05, found 596.

To the Cy3 disulphide linker (1.3 μmol) in solution in DMF (450 μl) is added at 0° C. 50 μl of a mixture of 1-(3-dimethylaminopropyl)-3-ethylcarbodiimide hydrochloride, 1-hydroxybenzotriazole hydrate and N-methylmorpholine (26 μM each) in DMF. The reaction mixture was stirred at room temperature for 1 h. The reaction was monitored by TLC (MeOH:CH₂Cl₂ 3:7) until all the dye linker was consumed. Then DMF (400 μl) was added at 0° C., followed by the nucleotide (31) (1.2 μmol) in solution in water (100 μl) and the reaction mixture and stirred at room temperature overnight. TLC (MeOH:CH₂Cl₂ 4:6) showed complete consumption of the activated ester and a dark red spot appeared on the baseline. The reaction was quenched with TEAB buffer (0.1M, 10 ml) and loaded on a DEAE Sephadex column (2×5 cm). The column was first eluted with 0.1 M TEAB buffer (100 ml) to wash off organic residues and then 1 M TEAB buffer (100 ml). The desired triphosphate (32) was eluted out with 1 M TEAB buffer. The fraction containing the product were combined, evaporated and purified by HPLC. HPLC conditions: t_(r)(32): 22.44 min (Zorbax C18 preparative column, gradient: 5% to 35% B in 20 min, buffer A 0.1M TEAB, buffer B MeCN). The product was isolated as dark pink solid (0.15 μmol, 12.5%, ε₅₅₀=150000). ¹H NMR (D₂O) δ 2.03 (t, 2H, CH₂), 2.25 (m, 1H, H−2′), 2.43 (m, 1H, H−2′), 2.50 (m, 2H, CH₂), 2.66 (m, 2H, CH₂), 3.79 (m, 2H CH₂), 3.99 (m, 4H, CH₂N, H−5′), 4.18 (br s, 1H, H−4′), 6.02, 6.17 (2d, J=13.64 Hz, 2H, H_(Ar)), 6.30 (dd, J=6.06, 8.58 Hz, H−1′), 7.08, 7.22 (2d, 2H, 2×═CH), 7.58-7.82 (m, 5H, H_(Ar), H−2, H−8), 8.29 (m, ═CH). ³¹P NMR (D₂O) δ −4.83 (m, 1P, P_(γ)), −10.06 (m, 1P, P_(α)), −20.72 (m, 1P, P_(β)).

Enzyme Incorporation of 3′-Azidomethyl dNTPs

To a 100 nM DNA primer/template (primer previously labelled with P32 and T4 polynucleotide kinase) in Tris-HCl pH 8.8 50 mM, Tween-20 0.01%, and MgSO₄ 4 mM, add 2 μM compound 6 and 100 nM polymerase (Thermococcus sp. 9° N exo ⁻Y409V A485L supplied by New England Biolabs). The template consists of a run of 10 adenine bases to show the effect of the block. The reaction is heated to 65 C for 10 mins. To show complete blocking, a chase is performed with the four native, unblocked nucleoside triphosphates. Quantitative incorporation of a single azidomethyl blocked dTTP can be observed and thus the azidomethyl group can be seen to act as an effective block to further incorporation.

By attaching a hairpin DNA (covalently attached self complementary primer/template) to a streptavidin bead The reaction can be performed over multiple cycles as shown in FIGS. 5 and 6.

Preparation of the Streptavidin Beads

Remove the storage buffer and wash the beads 3 times with TE buffer (Tris-HCl pH 8, 10 mM and EDTA, 1 mM). Resuspend in B & W buffer (10 mM Tris-HCl pH 7.5, 1 mM EDTA and 2.0 M NaCl), add biotinylated ³²P labelled hairpin DNA with appropriate overhanging template sequence. Allow to stand at room temperature for 15 minutes. Remove buffer and wash beads 3 times TE buffer.

Incorporation of the Fully Functional Nucleoside Triphosphate (FFN)

To a solution of Tris-HCl pH 8.8 50 mM, Tween-20 0.01%, MgSO₄ 4 mM, MnCl₂ 0.4 mM (except cycle 1, 0.2 mM), add 2 μM FFN and 100 nM polymerase. This solution is then added to the beads and mixed thoroughly and incubated at 65° C. for 10-15 minutes. The reaction mixture is removed and the beads washed 3 times with TE buffer.

Deblocking Step

Tris-(2-carboxyethyl)phosphines trisodium salt (TCEP) (0.1M) is added to the beads and mixed thoroughly. The mixture was then incubated at 65° C. for 15 minutes. The deblocking solution is removed and the beads washed 3 times with TE buffer.

Capping Step

Iodoacetamide (431 mM) in 0.1 mM phosphate pH 6.5 is added to the beads and mixed thoroughly, this is then left at room temperature for 5 minutes. The capping solution is removed and the beads washed 3 times with TE buffer.

Repeat as Required

The reaction products can be analysed by placing the bead solution in the well of a standard 12% polyacrylamide DNA sequencing gel in 40% formamide loading buffer. Running the gel under denaturing conditions causes the DNA to be released from the beads and onto the gel. The DNA band shifts are affected by both the presence of dye and the addition of extra nucleotides and thus the cleavage of the dye (and block) with the phosphine cause a mobility shift on the gel.

Two cycles of incorporation with compounds 18 (C), 24 (G) and 32 (A) and six cycles with compound 6 can be seen in figures FIG. 5 and FIG. 6. 3′-OH Protected with an Allyl Group:

Nucleotides bearing this blocking group at the 3′position have been synthesised, shown to be successfully incorporated by DNA polymerases, block efficiently and may be subsequently removed under neutral, aqueous conditions using water soluble phosphines or thiols allowing further extension.

5′-O-(t-Butyldimethylsilyl)-5-iodo-2′-deoxyuridine (33)

To a solution of 5-iodo-2′-deoxyuridine (5.0 g, 14 mmol) in 70 ml in dry N,N-dimethylformamide (DMF) was added imidazole (1.09 g, 16 mmol), followed by (2.41 g, 16 mmol) TBDMSCl at 0° C. The mixture was left in the ice bath and stirred overnight. The reaction was quenched with sat. aq. NaCl solution and extracted with EtOAc. After drying (MgSO₄), the solvent was removed and the crude mixture was purified by chromatography on silica (EtOAc:petroleum ether 3:7). The product (33) (5.9 g, 90%) was obtained as a colourless solid. ¹H NMR (d₆DMSO) δ 0.00 (s, 3H, CH₃), 0.79 (s, 9H, tBu), 1.88-1.97 (m, 1H, H−2′), 2.00-2.05 (m, 1H, H−2′), 3.59-3.71 (m, 2H, H−5′), 3.75 (br s, 1H, H−4′), 4.06 (br s, 1H, H−3′), 5.18 (d, J=4.0 Hz, 1H, OH), 5.98 (t, J=5.9 Hz, 1H, H−1′), 7.89 (s, 1H, H−6), 11.62 (s, 1H, NH). Mass (−ve electrospray) calcd for C₁₅H₂₅IN₂O₅Si 468.06 found 467.

3′-O-Allyl-5′-O-t-butyldimethylsilyl-5-iodo-2′-deoxyuridine (34)

To a suspension of NaH (497 mg, 12.4 mmol, 60% in mineral oil) in dry THF (20 ml) a solution of 5′-TBDMS protected 5-iodo-2′-deoxyuridine (2.8 g, 5.9 mmol) in dry THF (50 ml) was added drop wise. After the gas evolution had stopped the mixture was stirred for another 10 min and then allylbromide (561 μl, 6.5 mmol) was added drop wise. After the complete addition the milky reaction mixture was stirred at room temperature for 16 h. The reaction was quenched by addition of sat. aq. NaCl solution (30 ml). The aqueous layer was extracted three times using EtOAc and after washing with sat. aq. NaCl solution the organic phase was dried (MgSO₄). After removing of the solvents the crude product was purified by chromatography (EtOAc:petroleum ether 1:1). The allylated product (2.39 g, 80%) was obtained as a colourless foam. ¹H NMR (d₆ DMSO) δ −0.01 (s, 3H, CH₃), 0.78 (s, 9H, tBu), 1.94-2.01 (m, 1H, H−2′), 2.16-2.21 (m, 1H, H−2′), 3.61-3.71 (m, 2H, H−5′), 3.87-3.94 (m, 4H, H−3′, H−4′, OCH₂), 5.04 (dd, J=1.6, 10.4 Hz, 1H, ═CH₂), 5.15 (dd, J=1.8, 17.3 Hz, 1H, ═CH₂), 5.72-5.81 (m, 1H, CH═), 5.92 (t, J=5.7 Hz, 1H, H−1′), 7.88 (s, 1H, 6-H), 11.6 (s, 1H, NH). Mass (−ve electrospray) calcd for C₁₈H₂₉IN₂O₅Si 508.09, found 507.

3′-O-Allyl-5-iodo-2′-deoxyuridine (35)

To a solution of (34) (2.34 g, 4.71 mmol) in dry THF (40 ml) was added at 0° C. TBAF (5.2 ml, 5.2 mmol, 1 M solution in THF). The reaction mixture was allowed to warm up to room temperature and was then stirred for 16 h. The reaction was quenched by adding sat. NaCl solution (20 ml) and extracted with EtOAc three times. The combined organic layers were dried over MgSO₄. The crude mixture was purified by chromatography on silica (EtOAc:petrol 7:3). Product (35) (1.4 g, 75%) was isolated as a colourless solid. ¹H NMR (d₆ DMSO) δ 2.02-2.39 (m, 2H, H−2′), 3.42-3.52 (m, 2H, H−5′), 3.84-3.88 (m, 3H, H−4′, CH₂], 3.97-4.00 (m, 1H, H−3′), 5.02-5.09 (m, 2H, OH, ═CH₂), (dd, J=1.9, 17.3 Hz, 1H, ═CH₂), 5.73-5.82 (m, 1H, CH═), 5.94 (t, J=6.8 Hz, 1H, H−1′), 8.24 (s, 1H, H−6), 11.56 (s, 1H, NH). Mass (−ve electrospray) calcd for C₁₂H₁₆IN₂O₅ 394.0 found 393.

3′-O-Allyl-5-[3-(2,2,2-trifluoroacetamido)-prop-1-ynyl]-2′-deoxyuridine

To a solution of (35) (400 mg, 1.0 mmol) in dry DMF (10 ml) was added CuI (38 mg, 20 μmol) and triethylamine (300 μl, 2.0 mmol). The propargyltrifluoroacetamide (453 mg, 3.0 mmol) was added drop wise, followed by Pd(PPh₃)₄ (110 mg, 9.5 μmol). The reaction was stirred for 16 h in the dark. The reaction was quenched by adding MeOH (10 ml), DCM (10 ml) and bicarbonate dowex. The mixture was stirred for 30 min and then filtered. The solvents were removed under vacuum and the crude product was purified by chromatography on silica (EtOAc:petrol 3:7 to 7:3). The product was isolated as slightly yellow crystals (398 mg, 95%) ¹H NMR (d₆ DMSO) δ 2.25-2.43 (m, 2H, H−2′), 3.65-3.76 (m, 2H, H−5′), 4.07-4.17 (m, 3H, H−4′, CH₂), 4.21-4.23 (m, 1H, H−3′), 4.34 (d, J=5.5 Hz, 2H, CH₂N), 5.25-5.27 (m, 2H, ═CH₂, OH), 5.38 (dd, J=1.83, 17.3 Hz, 1H, ═CH₂), 5.96-6.06 (m, 1H, ═CH), 6.17 (t, J=6.9 Hz, 1H, H−1′), 8.29 (8, 1H, H−6), 10.17 (t, J=5.5 Hz, 1H, NHTFA), 11.78 (s, 1H, NH). Mass (−ve electrospray) calcd for C₁₇H₁₈F₃N₃O₆ 417.11, found 416.

3′-O-Allyl-5-[3-(2,2,2-trifluoroacetamido)-prop-1-ynyl]-2′-deoxyuridine 5′-O-nucleoside triphosphate (37)

Under nitrogen (36) (100 mg, 0.24 mmol) and proton sponge (61.5 mg, 0.28 mmol), both dried under vacuum over P₂O₅ for 24 h, were dissolved in OP(OMe)₃ (225 μl). At 0° C. freshly distilled POCl₃ was added drop wise and the mixture was stirred for 1.5 h. Then pyrophosphate (1.44 ml, 0.72 μmol, 0.5 M in DMF) and nBu₃N (0.36 ml, 1.5 mmol) were added and the resulting mixture stirred for another 1.5 h. Triethylammonium IS bicarbonate solution (4.5 ml, 0.1 M solution, TEAB) was added and the reaction mixture was left stirring for 2 h. Then ag. NH₃ (4.5 ml) was added and the mixture was stirred for 16 h. After removing the solvents to dryness, the residue was redissolved in water, filtered and purified by MPLC, followed by HPLC purification. The desired triphosphate (37) (10.2 μmol, 4%, ε₂₈₀=10000) was isolated as a colourless foam. MPLC conditions: a gradient was run from 0.05M TEAB to 0.7 M TEAB using 2 l of each on a DEAE sephadex column. The product containing fractions came off with ˜0.4 M TEAB. After removing the solvent, the product was HPLC purified. HPLC conditions: t_(r)(triphosphate): 21.9 min (Zorbax C-18 preparative column, buffer A 0.1 M TEAB, buffer B 0.1 M TEAB+30% Acetonitrile, gradient 5-35% buffer B in 35 min). ¹H NMR (D₂O) δ 2.17-2.23 (m, 1H, H−2′), 2.40-2.45 (m, 1H, H−2′), 3.67 (s, 2H, CH₂N), 3.99 (d, J=5.9 Hz, 2H, OCH₂), 4.02-4.17 (m, 2H, H−5′), 4.25 (br s, 1H, H−4′), 4.32-4.33 (m, 1H, H−3′), 5.13 (d, J=10.3 Hz, 1H, ═CH₂), 5.23 (d, J=17.2 Hz, 1H, ═CH₂), 5.78-5.88 (m. 1H, ═CH), 6.16 (t, J=6.7 Hz, 1H, H−1′), 8.33 (s, 1H, H−6). ³¹P NMR (161.9 MHz, D₂O) δ −21.3 (t, J=19.5 Hz, 1P, P_(γ)), −10.3 (d, J.=19 Hz, 1P, P_(α)), −7.1 (d, J=15.5 Hz, 1P, P_(β)). Mass (−ve electrospray) calcd for C₁₅H₂₂N₃O₁₄P₃ 561.03, found 560, 480 [M-phosphate], 401 [M−2×phosphate].

To a solution of Cy3 disulfide linker (2.5 μmol) in DMF (0.2 ml) at 0° C. was added. Disuccinimidyl carbonate (0.96 mg 3.75 μmol) and 4-(dimethylamino)pyridine (DMAP) (0.46 mg 3.75 μmol). The reaction mixture was stirred for 10 min and then checked by TLC (MeOH:DCM 3:7) (activated ester r_(f)=0.5). In a separate flask the 3′-O-allyl thymidine triphosphate (37) (532 μl, 14.1 mM in water, 7.5 μmol) were mixed with BU₃N (143 μl) and evaporated to dryness. After this the triphosphate (37) was dissolved in dry DMF (0.2 ml). To the triphosphate (37) solution at 0° C. was added the activated dye and the reaction mixture was allowed to warm to room temperature and then stirred for 16 h. The solvent was removed and the residue was dissolved in water. The reaction mixture was passed through a small DEAE sephadex column (2×5 cm) using 0.1 M TEAB (100 ml) to remove the coupling reagents and unreacted linker. With 1 M TEAB (100 ml) the triphosphate (38) was eluted. The mixture was then separated by HPLC. Yield: 1.41 μmol (56%, ε₅₅₀=150000) product as a dark red solid were isolated. HPLC conditions: t_(r) (38): 19.6 min (Zorbax C-18 preparative column, buffer A 0.1 M TEAB, buffer B Acetonitrile, gradient: 2-58% buffer B in 29 min). ¹H (d₆ DMSO) δ 0.75-0.79 (m, 3H, CH₃), 1.17-1.28 (m, 2H, CH₂), 1.48-1.55 (m, 2H, CH₂), 1.64 (s, 12H, 4×CH₃), 1.70-1.77 (m, 2H, CH₂), 1.96-2.02 (m, 1H, H−2′), 2.07-2.11 (m, 2H, CH₂), 2.25-2.30 (m, 1H, H−2′), 2.51-2.55 (m, 2H, CH₂), 2.64-2.68 (m, 2H, CH₂), 2.75-2.81 (m, 2H, CH₂), 3.27-3.31 (m, 2H, CH₂), 3.91-4.05 (m, 9H, H−5′, OCH₂, NCH₂, 2×NCH₂-dye), 4.13 (s, 1H, H−4′), 4.22-4.24 (m, 1H, H−3′), 5.06 (d, J=10.5 Hz, 1H, ═CH₂), 5.15 (dd, J=1.4 Hz, 17.3 Hz, 1H, ═CH₂), 5.72-5.82 (m, 1H, ═CH), 6.03-6.06 (m, 1H, H−1′), 6.20-6.29 (m, 2H, αH), 7.23-7.31 (m, 2H, H_(Ar)), 7.63-7.79 (m, 5H, H−6, 4×H_(Ar)), 8.31-8.45 (m, 1H, βH). ³¹P (161.9 MHz, d₆ DMSO) δ −20.2 (m, 1P, P_(β)), −10.0 (d, J 18.5 Hz, 1P, P_(α)), −4.8 (d, J 19.5 Hz, 1P, P_(γ)) Mass (−ve electrospray) calcd for C₅₁H₆₇S₄N₆O₂₂P₃ 1336.24, found 1335.1, 688.1 [cleaved disulfide (dye), 647.9 [cleaved disulfide (nucleotide)].

Enzyme Incorporation of Compound 38

To a 100 nM DNA primer/template (primer previously labelled with P32 and T4 polynucleotide kinase) in Tris-HCl pH 8.8 50 mM, Tween-20 0.01%, and MgSO₄ 4 mM, add 2 μM compound 38 and 100 nM polymerase (Thermococcus sp. 9° N exo ⁻Y409V A485L supplied by New England Biolabs). The template consists of a run of 10 adenine bases to show the effect of the block. The reaction is heated to 65 C for 10 mins. To show complete blocking, a chase is performed with the four native, unblocked nucleoside triphosphates. Quantitative incorporation of the allyl block can be observed (see FIG. 7) and this can be seen to act as an effective block to further incorporation.

5′-O-(tert-Butyldimethylsilyl)-5-iodo-2′-deoxycytidine (39)

To a solution of 5-iodo-2′-deoxycytidine (2.2 g, 6.23 mmol) in DMF (130 ml) was added imidazole (467 mg, 6.85 mmol). The mixture was cooled at 0° C. and tert-butyldimethylsilyl chloride (TBDMSCl) (1.33 g, 6.85 mmol) added over 5 minutes. After 18 h at room temperature, the volatiles were evaporated under reduced pressure and the residue purified by flash chromatography on silica gel with EtOAc:MeOH (95:5 to 90:10) to give the expected product (39) (2.10 g, 72%) together with unreacted starting material (490 mg). ¹H NMR (d₆ DMSO) δ 0.11 (s, 3H, CH₃), 0.12 (s, 3H, CH₃), 0.89 (s, 9H, 3CH₃), 1.90 (ddd, J=13.2, 7.7 and 5.7 Hz, 1H, HH-2′), 2.18 (ddd, J=13.2, 5.7 and 2.3 Hz, 1H, HH-2′), 3.72 (dd, J=11.5, 3.6 Hz, 1H, HH-5′), 3.80 (dd, J=11.5, 2.8 Hz, 1H, HH-5′), 3.86-3.89 (m, 1H, H−4′), 4.14-4.18 (m, 1H, H−3′), 5.22 (1H, d, J=4.1 Hz, OH), 6.09 (1H, dd, J=7.8, 5.8 Hz, H−1′), 6.60 (br s, 1H, NHH), 7.81 (br s, 1H, NHH), 7.94 (s, 1H, H−6); MS (ES): m/z (%) (M+H) 468 (90%).

3′-O-Allyl-5′-O-(tert-butyldimethylsilyl)-5-iodo-2′-deoxycytidine (40)

To a solution of NaH (60%, 113 mg, 2.84 mmol) in THF (26 ml) under N₂ atmosphere, was slowly added a solution of the starting nucleoside (39) (669 mg, 1.43 mmol) in THF (6 ml). The mixture was stirred at room temperature for 45 minutes, cooled at 0° C. and allyl bromide (134 μL, 1.58 mmol) was slowly added. After 15 h at room temperature, the solution was cooled to 0° C. and quenched by addition of H₂O (5 ml). THF evaporated under reduced pressure and the product extracted into EtOAc (3×25 ml). Combined organic extracts were dried (MgSO₄) filtered and the volatiles evaporated under reduced pressure to give a residue that was purified by flash chromatography on silica gel with EtOAc affording the expected 3′-O-allyl product (40) (323 mg, 44%) as a colourless oil, together with some unreacted starting material (170 mg); ¹H NMR (d₆ DMSO) δ 0.00 (s, 3H, CH₃), 0.01 (s, 3H, CH₃), 0.79 (s, 9H, 3CH₃), 1.84 (ddd, J=13.3, 8.2 and 5.5 Hz, 1H, H−2′), 2.20-2.25 (m, 1H, H−2′), 3.62-3.72 (m, 2H, H−5′), 3.88-3.93 (m, 4H, H−3′,4′, HHC—CH═), 5.1 (dd, J=8.5, 1.7 Hz, 1H, CH═CHH), 5.16 (dd, J=17.2, 1.7 Hz, 1H, CH═CHH), 5.75-5.83 (m, 1H, CH═CHH), 5.94 (dd, J=8.4, 5.6 Hz, 1H, H−1′), 6.53 (br s, 1H, NHH), 7.74 (br s, 1H, NHH), 7.83 (s, 1H, H−6); MS (ES): m/z (%) (M−H) 506 (100%).

3′-O-Allyl-5-iodo-2′-deoxycytidine (41)

To a solution of the starting nucleoside (40) (323 mg, 0.64 mmol) in THF (15 ml) under N₂ protected atmosphere was added at room temperature tetrabutylammonium fluoride (TBAF) 1M in THF (0.7 ml, 0.7 mmol). Mixture stirred for one hour and then quenched by addition of H₂O (5 ml). THF was evaporated and aqueous residue extracted into EtOAc (3×25 ml). Combined organic extracts were dried (MgSO₄), filtered and the volatiles evaporated under reduced pressure giving a crude material which was purified by flash chromatography on a pre-packed silica column eluted with EtOAc. The product (41) was obtained as a white solid (233 mg, 93%). ¹H NMR (d₆ DMSO) δ 1.96-2.05 (m, 1H, H−2′) 2.24 (ddd, J=13.5, 5.8 and 2.8 Hz, 1H, H−2′), 3.50-3.62 (m, 2H, H5′), 3.91-3.97 (m, 2H, H3′,H4′), 4.03-4.07 (m, 2H, HHC—CH═), 5.11-5.16 (m, 2H, OH, CH═CHH), 5.24 (dd, J=17.2, 1.6 Hz, 1H, CH═CHH), 5.82-5.91 (m, 1H, CH═CHH), 6.02 (dd, J=7.6, 6.0 Hz, 1H, H−1′), 6.60 (s, 1H, NHH), 7.79 (s, 1H, NHH), 8.21 (s, 1H, H−6). MS (ES): m/z (%) (M−H) 392 (100%).

3′-O-Allyl-5-[3-(2,2,2-trifluoroacetamide)-prop-1-ynyl]-2′-deoxycytidine (42)

To a solution of the starting nucleoside (41) (200 mg, 0.51 mmol) in dry DMF (8.5 ml) at room temperature and Argon atmosphere, was slowly added CuI (19 mg, 0.10 mmol), NEt₃ (148 μl, 1.02 mmol), 2,2,2-trifluoro-N-prop-2-ynyl-acetamide (230 mg, 1.53 mmol) and Pd(PPh₃) ₄ (58 mg, 0.05 mmol). The mixture was stirred at room temperature and protected from light during four hours, quenched by addition of dowex bicarbonate and stirred for a 1 h, then filtered and the volatiles evaporated under reduced pressure. The residue was further evaporated from MeOH (15 ml) and then purified by flash chromatography on silica gel (CH₂Cl₂, CH₂Cl₂:EtOAc 1:1, EtOAc:MeOH 97.5:2.5). The expected product (42) was obtained as a beige solid (180 mg, 85%). ¹H NMR (d₆ DMSO) δ 1.90 (ddd, J=13.6, 7.7 and 6.0 Hz, 1H, H−2′), 2.16 (ddd, J=13.6, 5.7 and 2.4 Hz, 1H, H−2′), 3.42-3.50 (m, 2H, H−5′), 3.84-3.87 (m, 3H, H−4′, OHHC—CH═), 3.94-3.96 (m, 1H, H−3′), 4.16 (d, J=5.1 Hz, 2H, H₂C—N), 4.98-5.05 (m, 2H, OH, CH═CHH), 5.14 (dd, J=17.3, 1.7 Hz, 1H, CH═CHH), 5.72-5.82 (m, 1H, CH═CHH), 5.95 (dd, J=7.7, 5.8 Hz, 1H, H−1′), 6.74 (br s, 1H, NHH), 7.72 (br s, 1H, NHH), 8.01 (1H, s, H−6), 9.82 (br t, 1H, HN—CH₂). MS (ES): m/z (%) (M−H) 415 (100%).

3′-O-Allyl-5-(3-amino-prop-1-ynyl)-5′-O-triphosphate-2′-deoxycytidine (43)

To a solution of the nucleoside (42) (170 mg, 0.41 mmol) and proton sponge (105 mg, 0.50 mmol) (both previously dried under P₂O₅ for at least 24 h) in PO(OMe)₃ (360 μl), at 0° C. under Argon atmosphere, was slowly added POCl₃ (freshly distilled) (50 μl, 0.54 mmol). The solution was vigorously stirred for 3 h at 0° C. and then quenched by addition of tetra-tributylammonium diphosphate 0.5 M in DMF (3.20 ml, 1.60 mmol), followed by nBu₃N (0.75 ml, 3.2 mmol) and triethylammonium bicarbonate (TEAB) 0.1 M (12 ml). The mixture was stirred at room temperature for 3 h and then an aqueous ammonia solution (ρ0.88 1.0 ml) (12 ml) was added. The solution was stirred at room temperature for 15 h, volatiles evaporated under reduced pressure and the residue was purified by MPLC with a gradient of TEAB from 0.05M to 0.7M. The expected triphosphate (43) was eluted from the column at approx. 0.51 M TEAB. A second purification was done by HPLC in a Zorbax SB-C18 column (21.2 mm i.d.×25 cm) eluted with 0.1M TEAB (pump A) and 30% CH₃CN in 0.1M TEAB (pump B) using a gradient as follows: 0-5 min 5% B, Φ0.2 ml; 5-25 min 80% B, Φ0.8 ml; 25-27 min 95% B, Φ0.8 ml; 27-30 min 95% B, Φ0.8 ml; 30-32 min 5 % B, Φ0.8 ml; 32-35 min 95% B, Φ0.2 ml, affording the product (43) detailed above with a t_(r)(43): 20.5 (20 μmols, 5% yield); ³¹P NMR (D₂O) δ −6.01 (d, J=19.9 Hz, 1P, P_(γ)), −10.24 (d, J=19.3 Hz, 1P, P_(α)), −21.00 (t, J=19.6 Hz, 1P, P_(β)); ¹H NMR (D₂O) δ 2.19-2.26 (m, 1H, H−2′), 2.51 (1H, ddd, J=14.2, 6.1 and 3.2 Hz, H−21), 3.96-4.07 (m, 4H, NCH₂, OHHC—CH═), 4.09-4.14 (m, 1H, 1H, H−5′) 4.22-4.26 (m, 1H, H−5′), 4.30-4.37 (m, 2H, H−3′, 4′), 5.20 (d, J=10.4 Hz, 1H, CH═CHH), 5.30 (1H, dd, J=17.3, 1.5 Hz, CH═CHH), 5.85-5.95 (m, 1H, CH═CHH), 6.18 (t, J=6.5 Hz, 1H, H−1′), 8.40 (s, 1H, H−6); MS (ES): m/z (%) (M−H) 559 (100%)

To a solution of Alexa Fluor 488 disulfide linker (2.37mg, 3.4 μmol) in DMF (500 μl) was added N,N-disuccinimidyl carbonate (1.3 mg, 5.1 μmol) and 4-DMAP (0.6 mg, 5.1 μmol). The mixture was stirred for 10 minutes, then it was added into the solution of the nucleotide (43) (3.23 mg, 5.8 μmol) in DMF (100 μl) containing nBu₃N (30 μl). The mixture was continuously stirred for 16 h at room temperature. The volatiles were evaporated under reduced pressure and the residue was firstly purified by passing it through a short ion exchange resin Sephadex-DEAE A-25 (40-120μ)-column, first eluted with TEAB 0.1 M (70 ml) then 1.0 M TEAB (100 ml). The latest containing the expected product (44) was concentrated and the residue was HPLC purified in a Zorbax SB-C18 column (21.2 mm i.d.×25 cm) eluted with 0.1M TEAB (pump A) and CH₃CN (pump B) using a gradient as follows: 0-2 min 2% B, Φ0.2 ml; 2-4 min 2% B, Φ0.8 ml; 4-15 min 23% B, Φ0.8 ml; 15-24 min 23% B, Φ0.8 ml; 24-26 min 95% B, Φ0.8 ml; 26-28 min 95 % B, Φ0.8 ml, 28-30 min 2% B, Φ0.8 ml, 30-33 min 2% B, Φ0.2 ml affording the product detailed above with a r_(t)(44): 19.9 (0.56 μmols, 17% yield based on UV measurement); λ_(max)=493 nm, ε 71,000 cm⁻¹ M⁻¹ in H₂O); ³¹P NMR (D₂O) δ −5.07 (d, J=22.2 Hz, 1P, P_(χ)), −10.26 (d, J=19.4 Hz, 1P, P_(α)), −21.09 (t, J=19.7 Hz, 1P, P_(β)); ¹H NMR (D₂O) δ 2.44-2.26 (m, 2H, HH-2′), 2.50 (t, J=6.7 Hz, 2H, CH₂), 2.83 (4H, CH₂, CH₂), 3.58 (t, J=6.0 Hz, 2H, CH₂), 4.07-3.91 (m, 6H, HH-5′, NCH₂, OHHC—CH═),4.16-4.12 (m, 1H, H−4′), 4.23-4.17 (m, 1H, H−3′), 5.24-5.09 (m, 2H, CH═CHH, CH═CHH), 5.84-5.74 (m, 1H, CH═CHH), 5.98 (t, J=8.1 Hz, 1H, H−1′), 6.79 (d, J=9.1 Hz, 1H, H_(Ar)), 6.80 (d, J=9.3 Hz, 1H, H_(Ar)), 7.06 (t, J=8.8 Hz, 2H, H_(Ar)), 7.55 (br s, 1H, H_(Ar)), 7.90-7.85 (m, 2H, H_(Ar)), 7.94 (s, 1H, H−6); MS (ES): m/z (%) (M−H)⁻ 1239 (27%).

5′-O-(tert-Butyldimethylsilyl)-7-deaza-7-iodo-2′-deoxyguanosine (45)

A solution of (44) (0.55 g, 1.4 mmol) in DMF (10 ml) was treated with imidazole (190 mg, 2.8 mmol) and TBDMSCl (274 mg, 1.82 mmol) at r.t. for 15 h. The reaction was quenched with MeOH (˜5 ml). The mixture was evaporated to dryness. Water (˜300 ml) was added to the residue and stirred for at least 1 h to fully dissolve imidazole. Filtration gave a brown solid, which was dried and purified by silica gel chromatography (DCM to DCM: MeOH 90:10), giving (45) as pale yellow powder (394 mg , 56%). ¹H NMR (d₆ DMSO) δ 0.00, 0.01 (2s, 6H, CH₃), 0.82 (s, 9H, CH₃), 1.99-2.05, 2.16-2.22 (2m, 2H, H−2′), 3.58-3.66 (m, 2H, H−5′), 3.72-3.74 (m, 1H, H−4′), 4.18-4.19 (m, 1H, H−3′), 5.16 (d, J=3.0 Hz, 1H, OH), 6.20 (dd, J=6.0, 8.0 Hz, 1H, H−1′), 6.25 (br s, 2H, NH₂), 7.58 (s, 1H, H−8), 10.37 (s, 1H, HN). Mass (−ve electrospray) calcd for C₁₇H₂₇IN₄O₄Si 506, found 505.

3′-O-Allyl-5′-O-(tert-butyldimethylsilyl)-7-deaza-7-iodo-2′-deoxyguanosine (46)

A solution of (45) (354 mg, 0.7 mmol) in THF (25 ml) was treated with NaH (42 mg, 1.75 mmol) at r.t. for 1 h. Allyl bromide was added and the suspension was stirred at r.t. for 2 days. −60% of the starting material (45) was converted to the product (46). The reaction was quenched with sat. aq. NaCl and extracted with DCM three times. The combined organic layer were dried (MgSO₄) and concentrated under vacuum. The residue was treated with TBAF in THF (1 ml) and THF (1 ml) for 30 min. Evaporation to remove of THF. The residue was dissolved in DCM and aqueous NaHCO₃ (sat.) was added. The aqueous layer was extracted with DCM three times. The combined organics was dried over MgSO₄ and concentrated under vacuum. Purification by chromatography on silica (EtOAc to EtOAc:MeOH 85:15) gave (46) as a yellow foam (101 mg, 35%). ¹H NMR (d₆ DMSO) δ 2.15-2.31 (m, 2H, H−2′), 3.41-3.45 (m, 2H, H−5′), 3.82-3.85 (m, 1H, H−4′), 3.93 (d, J=2.6 Hz, 2H, OCH₂), 4.04-4.06 (m, 1H, H−3′), 4.99 (t, J=5.4 Hz, OH), 5.08-5.24 (m, 2H, ═CH₂), 5.79-5.89 (m, 1H, CH═), 6.15 (dd, J=5.9, 9.1 Hz, 1H, H−1′), 6.27 (br s, 2H, NH₂), 7.07 (s, H−8), 10.39 (s, 1H, NH). Mass (−ve electrospray) calcd for C₁₄H₁₇IN₄O₄ 432, found 431.

3′-O-Allyl-5′-O-(tert-butyldimethylsilyl)-7-deaza-7-[3-(2,2,2-trifluoroacetamido)-prop-1-ynyl]-2′-deoxyguanosine (47)

Under N₂, a suspension of (46) (104 mg, 0.24 mmol), Pd(PPh₃)₄ (24 mg, 0.024 mmol), CuI (9.1 mg, 0.048 mmol), Et₃N (66 μL, 0.48 mmol) and CH≡CCH₂NHCOCF₃ (89 μL, 0.72 mmol) in DMF (2 ml) was stirred at r.t. for 15 h. The reaction was protected from light with aluminium foil. After TLC indicating the full consumption of starting material, the reaction mixture was concentrated. The residue was diluted with MeOH (20 ml) and treated with dowex-HCO₃ ⁻. The mixture was stirring for 30 min and filtered. The solution was concentrated and purified by silica gel chromatography (petroleum ether:EtOAc 50:50 to petroleum ether:EtOAc:MeOH 40:40:20) giving (47) as a yellow powder (74 mg, 70%). ¹H NMR (d₆ DMSO) δ 2.15-2.39 (m, 2H, H−2′), 3.42-3.44 (m, 2H, H−5′), 3.83-3.87 (m, 1H, H−4′), 3.93-3.95 (m, 2H, OCH₂), 4.0-4.07 (m, 1H, H−3′), 4.15 (d, J=5.3 Hz, 2H, ≡CCH₂), 4.91 (t, J=5.4 Hz, OH), 5.08-5.24 (m, 2H, ═CH₂), 5.80-5.89 (m, 1H, CH═), 6.15 (dd, J=5.6, 8.9 Hz, 1H, H−1′), 6.28 (br s, 2H, NH₂), 7.24 (s, H−8), 9.98 (t, J=5.3 Hz, 1H, NH), 10.44 (s, 1H, NH). Mass (−ve electrospray) calcd for C₁₉H₂₀F₃N₅O₅ 455, found 454.

The nucleoside (47) and proton sponge was dried over P₂O₅ under vacuum overnight. A solution of (47) (73 mg, 0.16 mmol) and proton sponge (69 mg, 0.32 mmol) trimethylphosphate (0.5 ml) was stirred with 4 Å molecular sieves for 1 h. Freshly distilled POCl₃ (18 μl, 0.19 mmol) was added and the solution was stirred at 4° C. for 2 h. The mixture was slowly warmed up to room temperature and bis (tri-n-butyl ammonium) pyrophosphate (1.3 ml, 0.88 mmol) and anhydrous tri-n-butyl amine (0.3 ml, 1.28 mmol) was added. After 5 min, the reaction was quenched with 0.1 M TEAB (triethylammonium bicarbonate) buffer (10 ml) and stirred for 3 h. The water was removed under reduced pressure and the resulting residue dissolved in concentrated ammonia (ρ0.88, 10 ml) and stirred at room temperature for 16 h. The reaction mixture was then evaporated to dryness. The residue was dissolved in water and the solution applied to a DEAE-Sephadex A-25 column. MPLC was performed with a linear gradient of 2 L each of 0.05 M and 1 M TEAB. The triphosphate was eluted between 0.7 M and 0.8 M buffer. Fractions containing the product were combined and evaporated to dryness. The residue was dissolved in water and further purified by HPLC. t_(r)(48)=20.3 min (Zorbax C18 preparative column, gradient: 5% to 35% B in 30 min, buffer A 0.1 M TEAB, buffer B MeCN). The product (48) was isolated as a white foam (147 O.D., 19.3 μmol, 12%, ε₂₆₀=7,600). ¹H NMR (D₂O) δ 2.38-2.46 (m, 2H, H−2′), 3.91 (m, 2H, ≡CCH₂), 3.98-4.07 (m, 4H, H−5′, 2H, OCH₂), 4.25 (br s, 1H, H−4′), 4.40 (br s, 1H, H−3′), 5.16-5.30 (m, 1H, ═CH₂), 5.83-5.91 (m, 1H, ═CH), 6.23-6.27 (m, 1H, H−1′), 7.44 (s, 1H, H−8). ³¹P NMR δ −7.1 (d, J=16.5 Hz, 1P, P_(γ)), −10.1 (d, J=19.9 Hz, 1P, P_(α)), −21.5 (t, J=18.0 Hz, 1P, P_(β)). Mass (−ve electrospray) calcd for C₁₇H₂₄N₅O₁₃P₃ 599, found 598.

7-Deaza-5′-O-diphenylsilyl-7-iodo-2′-deoxyadenosine (49)

TBDPSCl (0.87 g, 2.78 mmol) was added to a stirred solution of 7-deaza-7-iodo-2′-deoxyadenosine (1.05 g, 2.78 mmol) in dry pyridine (19 ml) at 5° C. under N₂. After 10 min the solution was allowed to rise to room temperature and stirred for 18 h. The solution was evaporated under reduced pressure and the residue purified by flash chromatography on silica (DCM to DCM:MeOH 19:1). This gave the desired product (49) (1.6 g, 83%). ¹H NMR (d₆ DMSO) δ 1.07 (s, 9H), 2.31-2.36 (m, 1H), 3.76-3.80 (dd, 1H, J=11.1, 4.7 Hz), 3.88-3.92 (dd, 1H, J=11.2, 3.9 Hz), 3.97-4.00 (m, 1H), 4.49-4.50 (m, 1H), 5.83 (s, 1H), 6.58-6.61 (t, 1H, J=6.7 Hz), 7.44-7.55 (m, 6H), 7.68-7.70 (m, 5H), 8.28 (s, 1H). Mass (electrospray) calcd for C₂₇H₃₁IN₄O₃Si 614.12, found 613.

7-Deaza-6-N,N-dimethylformadine-5′-O-diphenylsilyl-7-iodo-2′-deoxyadenosine (50)

A solution of (49) (1.6g, 2.61 mmol) in MeOH (70 ml) containing dimethylformamide dimethylacetal (6.3 g, 53 mmol) was heated at 45° C. for 18 h. The solution was cooled, evaporated under reduced pressure and purified by flash chromatography on silica gel (EtOAc to EtOAc:MeOH 98:2). This resulted in 1.52 g (87%) of the desired product (50). ¹H NMR (d₆ DMSO) δ 0.85 (s, 9H), 2.05-2.11 (m, 1H), 3.03 (s, 3H), 3.06 (s, 3H), 3.53-3.57 (dd, 1H, J=11.1, 4.8 Hz), 3.65-3.69 (dd, 1H, J=11.1, 4 Hz), 3.73-3.76 (q, 1H, J=4 Hz), 4.26-4.28 (m, 1H), 5.21-5.22 (d, 1H, J=4.3 Hz), 6.39-6.42 (t, 1H, J=6.8 Hz), 7.21-7.32 (m, 6H), 7.46 (s, 1H), 7.45-7.48 (m, 4H), 8.15 (s, 1H), 8.68 (s, 1H). Mass (+ve electrospray) calcd for C₃₀H₃₆IN₅O₃Si 669.16, found 670.

3′-O-Allyl-7-deaza-6-N,N-dimethylformadine-5′-O-diphenylsilyl-7-iodo-2′-deoxyadenosine (51)

A solution of (50) (1.52 g, 2.28 mmol) in dry THF (5 ml) was added drop wise at room temperature to a stirred suspension of sodium hydride (60%, 109 mg, 2.73 mmol) in dry THF (35 ml). After 45 min the yellow solution was cooled to 5° C. and allyl bromide (0.413 g, 3.41 mmol) added. The solution was allowed to rise to room temperature and stirred for 18 h. After adding isopropanol (10 drops) the solution was partitioned between water (5 ml) and EtOAc (50 ml). The organic layer was separated and the aqueous solution extracted further with EtOAc (2×50 ml). The combined organic solutions were dried (MgSO₄) and evaporated under reduced pressure. The residue was purified by flash chromatography on silica (petroleum ether:EtOAc 1:3 to EtOAc) to give 1.2 g (74%) of the desired product (51) as a gum. ¹H NMR (d₆DMSO) δ 1.03 (s, 9H), 2.39-2.45 (m, 1H), 2.60-2.67 (m, 1H), 3.2 (s, 3H), 3.23 (s, 3H), 3.70-3.74 (dd, 1H, J=11.2, 4.6 Hz), 3.83-3.87 (dd, 1H, J=11, 5.4 Hz), 4.03-4.08 (m, 3H), 4.30-4.31 (m, 1H), 5.18-5.21 (m, 1H), 5.28-5.33 (m, 1H), 5.89-5.98 (m, 1H), 6.49-6.53 (dd, 1H, J=8.4, 5.8 Hz), 7.41-7.51 (m, 6H), 7.62-7.66 (m, 5H), 8.31 (s, 1H), 8.85 (s, 1H). Mass (+ve electrospray) calcd for C₃₃H₄₀IN₅O₃Si 709.19, found 710.

3′-O-Allyl-7-deaza-6-N,N-dimethylformadine-7-iodo-21-deoxyadenosine (52)

A 1M solution of TBAF in THF (4.4 ml, 4.4 mmol) was added to a solution of (51) (1.2 g, 1.69 mmol) in THF (100 ml) at 50° C. under N₂. The solution was allowed to rise to room temperature and stirred for 2 d. The solution was evaporated under reduced pressure and purified by flash chromatography on silica (EtOAc to EtOAc:MeOH 97:3). This gave 593 mg (77%) of the desired product (52). ¹H NMR (d₆DMSO) δ 2.54 (m, 2H), 3.40 (s, 3H), 3.44 (s, 3H), 3.72-3.8 (m, 2H), 4.18-4.21 (m, 1H), 4.23-4.27 (m, 3H), 4.4-4.42 (d, 1H, J=5.7 Hz), 5.35-5.41 (m, 2H), 5.49-5.5 (q, 1H, J=1.7 Hz), 5.53-5.55 (q, 1H, J=1.7 Hz), 6.1-6.2 (m, 1H), 6.67-6.70 (dd, 1H, J=8.8, 5.5 Hz), 7.96 (s, 1H), 8.53 (s, 1H), 9.06 (s, 1H). Mass (+ve electrospray) calcd for C₁₇H₂₂IN₅O₃ 471.08, found 472.

3′-O-Allyl-7-deaza-7-iodo-2′-deoxyadenosine (53)

A solution of (52) (593mg, 1.3 mmol) in MeOH (20 ml) containing 35% aqueous ammonia (20 ml) was heated at 50° C. for 2 d. After cooling the solution was evaporated under reduced pressure and then azeotroped with toluene (3×10 ml). This resulted in 530mg (98%) of the desired product (53) as a solid. ¹H NMR (d₆ DMSO) δ 2.39 (m, 1H), 3.56-3.65 (m, 2H), 4.03-4.05 (m, 1H), 4.09-4.11 (m, 2H), 5.23-5.25 (d, 1H, J=10.6 Hz), 5.35-5.4 (d, 1H, J=15.4 Hz), 5.95-6.05 (m, 1H), 6.48-6.51 (dd, 1H, J=8.9, 5.5 Hz), 6.6-6.95 (s, 1H), 7.75 (s, 1H), 8.16 (s, 1H). Mass (+ve electrospray) calcd for C₁₄H₁₇IN₄O₃ 4-16.03, found 417.

3′-O-Allyl-7-deaza-7-[3-(2,2,2-trifluoroacetamide)]-2′-deoxyadenosine (54)

To a solution of (53) (494 mg, 1.19 mmol) in dry DMF (17 ml) was added sequentially copper (I) iodide (45.1 mg, 0.24 mmol), N-2,2,2-trifluoro-N-prop-2-ynylacetamide (538 mg, 3.56 mmol), Et₃N (240 mg, 2.38 mmol) and Pd(Ph₃P)₄ (137 mg, 0.12 mmol) at room temperature. The flask was wrapped in foil to exclude light and stirred under N₂ for 18 h. Then MeOH (10 ml) and a small spatula of dowex bicarbonate H⁺ form were added and the mixture stirred for 30 min. The mixture was filtered, evaporated under reduced pressure and the residue triturated with MeOH to remove palladium salts. The filtrate was evaporated under reduced pressure and purified by flash chromatography on silica (DCM to DCM:MeOH 97:3). The desired product (54) was obtained as brown solid (490 mg, 94%). ¹H NMR (d₆DMSO) δ 2.25-2.31 (m, 1H), 2.98-3.04 (m, 1H), 3.41-3.49 (m, 2H), 3.88-3.95 (m, 3H), 4.10-4.12 (d, 1H, J=5.2 Hz), 4.22-4.23 (d, 2H, J=5.3 Hz), 5.07-5.12 (m, 2H), 5.19-5.24 (dd, 1H, J=17.3, 1.9 Hz), 5.79-5.89 (m, 1H), 6.31-6.35 (dd, 1H, J=8.6, 5.6 Hz), 7.69 (s, 1H), 8.02 (S, 1H). Mass (−ve electrospray) calcd for C₁₉H₂₀F₃N₅O₄ 439.15 , found 438.

3′-O-Allyl-7-[3-aminoprop-1-ynyl]-7-deaza-2′-deoxyadenosine 5′-O-nucleoside triphosphate (55)

The nucleoside (54) and proton sponge was dried over P₂O₅ under vacuum overnight. A solution of (54) (84 mg, 0.191 mmol) and proton sponge (49 mg, 0.382 mmol) in trimethylphosphate (600 μl) was stirred with 4 Å molecular sieves for 1 h. Freshly distilled POCl₃ (36 μl, 0.388 mmol) was added and the solution was stirred at 4° C. for 2 h. The mixture was slowly warmed up to room temperature and bis (tri-n-butyl ammonium) pyrophosphate 0.5 M in solution in DMF (1.52 ml, 0.764 mmol) and anhydrous tri-n-butyl amine (364 μl, 1.52 mmol) was added. After 5 min, the reaction was quenched with 0.1 M TEAB (triethylammonium bicarbonate) buffer (5 ml) and stirred for 3 h. The water was removed under reduced pressure and the resulting residue dissolved in concentrated ammonia (ρ0.88, 5 ml) and stirred at room temperature for 16 h. The reaction mixture was then evaporated to dryness. The residue was dissolved in water and the solution applied to a DEAE-Sephadex A-25 column. MPLC was performed with a linear gradient of 0.05 M to 1 M TEAB. Fractions containing the product were combined and evaporated to dryness. The residue was dissolved in water and further purified by HPLC. HPLC: t_(r)(55)=: 22.60 min (Zorbax C18 preparative column, gradient: 5% to 35% B in 20 min, buffer A O.1M TEAB, buffer B MeCN) The product was isolated as a white foam (17.5 μmol, 5.9%, ε₂₈₀=15000). ¹H NMR (D₂O) δ 2.67-2.84 (2m, 2H, H−2′), 4.14 (br s, 2H, CH₂NH), 4.17-4.36 (m, 2H, H−5′), 4.52 (br s, 1H, H−4′), 6.73 (t, J=6.6 Hz, 1H, H−1′), 8.06 (s, 1H, H−8), 8.19 (s, 1H, H−2). ³¹P NMR (D₂O) δ −5.07 (d, J=21.8 Hz, 1P, P_(γ)), −10.19 (d, J=19.8 Hz, 1P, P_(α)), −21.32 (t, J 19.8 Hz, 1P, P_(β)) Mass (−ve electrospray) calcd for C₁₅H₂₁N₈O₁₂P₃ 598.05, found 596

To the Cy3 disulphide linker (2.6 μmol) in solution in DMF (450 μl) is added at 0° C. 100 μl of a mixture of 1-(3-dimethylaminopropyl)-3-ethylcarbodiimide hydrochloride, 1-hydroxybenzotriazole hydrate and N-methylmorpholine (26 μM each) in DMF. The reaction mixture was stirred at room temperature for 1 h. The reaction was monitored by TLC (MeOH:CH₂Cl₂ 4:6) until all the dye linker was consumed. Then 400 μl of DMF are added at 0° C., followed by the nucleotide (55) (3.9 μmol), in solution in water (100 μl) and the reaction mixture and stirred at room temperature overnight. TLC (MeOH:CH₂Cl₂ 4:6) showed complete consumption of the activated ester and a dark red spot appeared on the baseline. The reaction was quenched with TEAB buffer (0.1M, 10 ml) and loaded on a DEAE Sephadex column (2×5 cm). The column was first eluted with 0.1 M TEAB buffer (100 ml) to wash off organic residues and then 1 M TEAB buffer (100 ml). The desired triphosphate (56) was eluted out with 1 M TEAB buffer. The fraction containing the product were combined, evaporated and purified by HPLC. HPLC conditions: t_(r)(56)=: 21.38 min (Zorbax C18 preparative column, gradient: 5% to 15% B in 1 min, then 4 min at 15% B, then 15 to 35% B in 15 min, buffer A 0.1M TEAB, buffer B MeCN). The product was isolated as dark pink solid (0.15 μmol, 12.5%, ε₅₅₀=15000). ¹H NMR (D₂O) δ 2.03 (t, J=6.4 Hz, 2H, CH₂), 2.21-2.33 (m, 1H, H−2′), 2.37-2.49 (m, 1H, H−2′), 2.50 (t, J=6.3 Hz, 2H, CH₂), 2.66 (t, J=5.4 Hz, 2H, CH₂), 3.79 (t, J=6.4 Hz, 2H CH₂), 3.99 (m, 4H, CH₂N, H−5′), 4.18 (br s, 1H, H−4′), 6.02, 6.17 (2d, J=13.6 Hz, 2H, H_(ar)), 6.30 (dd, J=6.1, 8.6 Hz, H−1′), 7.08, 7.22 (2d, J=7.8, 8.6 Hz, 2H, 2×═CH), 7.58-7.82 (m, 6H, 2H_(Ar), H−2, H−8), 8.29 (t, J=13.6 Hz, ═CH) ³¹P NMR (D₂O) δ −4.83 (m, 1P, P_(γ)), −10.06 (m, 1P, P_(α)), −20.72 (m, 1P, P_(β)).

Cleavage of 3′-Allyl Group in Aqueous Conditions

The following shows a typical deblocking procedure for a 3′blocked nucleoside in which approximately 0.5 equivalents of Na₂PdCl₄ and 4 equivalents of the water-soluble phosphine ligand L were employed, in water, at 50° C. Tfa stands for trifluoracetyl:

To a solution of Ligand L (7.8 mg, 13.7 μmol) in degassed H₂O (225 μl) was added a solution of Na₂PdCl₄ (0.5 mg, 1.6 μmol) in degassed H₂O (25 μl) in an eppendorff vial. The two solutions were mixed well and after 5 min a solution of B (1 mg, 2.3 μmol) in H₂O (250 μl) was added. The reaction mixture was then placed in a heating block at 50° C. The reaction could be followed by HPLC. Aliquots of 50 μl were taken from the reaction mixture and filtered through an eppendorff filter vial (porosity 0.2 μm); 22 μl of the solution were injected in the HPLC to monitor the reaction. The reaction was purified by HPLC. In a typical experiment the cleavage was finished (i.e. >98% cleavage had occurred after 30 min). 3′-OH Protected with a 3,4 dimethoxybenzyloxymethyl Group as a Protected Form of a Hemiacetal

Nucleotides bearing this blocking group have similar properties to the allyl example, though incorporate less rapidly. Deblocking can be achieved efficiently by the use of aqueous buffered cerium ammonium nitrate or DDQ, both conditions initially liberating the hemiacetal (1) which decomposes to the required (2) prior to further extension:

The 3′-OH may also be protected with benzyl groups where the phenyl group is unsubstituted, e.g. with benzyloxymethyl, as well as benzyl groups where the phenyl group bears electron-donating substituents; an example of such an electron-rich benzylic protecting group is 3,4-dimethoxybenzyloxymethyl.

In contrast, electron-poor benzylic protecting groups, such as those in which the phenyl ring is substituted with one or more nitro groups, are less preferred since the conditions required to form the intermediate groups of formulae —C(R′)₂—OH, —C(R′)₂—NH_(2,) and —C(R′)₂—SH are sufficiently harsh that the integrity of the polynucleotide can be affected by the conditions needed to deprotect such electron-poor benzylic protecting groups.

3′-OH Protected with a Fluoromethyloxymethyl Group as a Protected Form of a Hemiacetal —O—CH₂—F Nucleotides bearing this blocking group may be converted to the intermediate hemiacetal using catalytic reactions known to those skilled in the art such as, for example, those using heavy metal ions such as silver. 

1. A modified nucleotide or nucleoside molecule comprising a purine or pyrimidine base and a ribose or deoxyribose sugar moiety having a removable 3′-OH blocking group covalently attached thereto, such that the 3′ carbon atom has attached a group of the structure —O-Z wherein Z is any of —C(R^(IV))₂—O—R″, —C(R′)₂—N(R″)₂, —C(R′)₂—N(H)R″, —C(R^(IV))₂—S—R″ and —C(R″)₂—F, wherein —C(R^(IV))₂—O—R″ is of the formula —CR⁴(R⁵)—O—CR⁴(R⁵)—OR⁶ or of the formula —CR⁴(R⁵)—O—CR⁴(R⁵)—SR⁶; and wherein —C(R^(IV))₂—S—R″ is of the formula —CR⁴(R⁵)—S—CR⁴(R⁵)—OR⁶ or of the formula —CR⁴(R⁵)—S—CR⁴(R⁵)—SR⁶; wherein each R″ is or is part of a removable protecting group; each R′ is independently a hydrogen atom, an alkyl, substituted alkyl, arylalkyl, alkenyl, alkynyl, aryl, heteroaryl, heterocyclic, acyl, cyano, alkoxy, aryloxy, heteroaryloxy or amido group, or a detectable label attached through a linking group; or (R′)₂ represents an alkylidene group of formula ═C(R′″)₂ wherein each R′″ may be the same or different and is selected from the group comprising hydrogen and halogen atoms and alkyl groups; each R⁴ and R⁵ is independently a hydrogen atom or an alkyl group; R⁶ is alkyl, cycloalkyl, alkenyl, cycloalkenyl or benzyl; and wherein said molecule may be reacted to yield an intermediate in which each R″ is exchanged for H or, where Z is —C(R′)₂—F, the F is exchanged for OH, SH or NH₂, preferably OH, which intermediate dissociates under aqueous conditions to afford a molecule with a free 3′OH; with the proviso that where Z is —C(R^(IV))₂—S—R″, both R^(IV) groups are not H.
 2. A molecule according to claim 1 wherein R′ is an alkyl or substituted alkyl.
 3. A molecule according to claim 1 wherein -Z is of formula —C(R′)₂—N₃.
 4. A molecule according to claim 1 wherein Z is an azidomethyl group.
 5. A molecule according to claim 1 wherein R″ is a benzyl or substituted benzyl group.
 6. A molecule according to claim 1 wherein said base is linked to a detectable label via a cleavable linker or a non-cleavable linker.
 7. A molecule according to claim 6 wherein said linker is cleavable.
 8. A molecule according to claim 1 wherein a detectable label is linked to the molecule through the blocking group by a cleavable or non-cleavable linker.
 9. A molecule according to claim 6 wherein said detectable label is a fluorophore.
 10. A molecule according to claim 6 wherein said linker is acid labile, photolabile or contains a disulfide linkage.
 11. A modified nucleotide molecule as claimed in claim 1 which comprises one or more ³²P atoms in its phosphate portion.
 12. A nucleoside, nucleotide or polynucleotide molecule of formula PN—O-allyl, wherein PN is said nucleoside or nucleotide or is a 3′terminal nucleotide of said polynucleotide; and said nucleoside or nucleotide further comprises in addition to the allyl blocking group a detectable label linked to the base thereof by a cleavable or non-cleavable linker.
 13. A molecule according to claim 12 wherein said linker is cleavable.
 14. A molecule according to claim 12 wherein said detectable label is a fluorophore.
 15. A molecule according to claim 12 wherein said linker is acid labile, photolabile or contains a disulfide linkage.
 16. A method of converting a compound of formula R—O-allyl, R₂N(allyl), RNH(allyl), RN(allyl)₂ or R—S-allyl to a corresponding compound in which the allyl group is removed and replaced by hydrogen, said method comprising the steps of reacting a compound of formula R—O-allyl, R₂N(allyl), RNH(allyl), RN(allyl)₂ or R—S-allyl in aqueous solution with a transition metal comprising a transition metal and one or more ligands selected from the group comprising water-soluble phosphine and water-soluble nitrogen-containing phosphine ligands, wherein the or each R is a water-soluble biological molecule.
 17. The method of claim 16 wherein said compound is of formula R—O-allyl.
 18. The method of claim 16 wherein said R is part of a nucleoside, a nucleotide or a polynucleotide molecule.
 19. The method of claim 18 wherein said nucleoside, nucleotide or polynucleotide further comprises a detectable label linked to the base thereof by a cleavable or non-cleavable linker.
 20. A molecule according to claim 19 wherein said linker is cleavable.
 21. The method of claim 19, wherein said detectable label is a fluorophore.
 22. The method of claim 19 wherein said linker is acid labile, photolabile or contains a disulfide linkage.
 23. The method of claim 19 wherein said allyl group and said label are removed in a single step.
 24. The method of claim 16 wherein said transition metal is selected from the group comprising platinum, palladium, rhodium, ruthenium, osmium and iridium.
 25. The method of claim 16 wherein said transition metal is palladium.
 26. The method of claim 16 wherein said group of ligands comprise derivatised triaryl phosphine ligands or derivatised trialkyl phosphine ligands.
 27. The method of claim 16 wherein said group of ligands are derivatised with one or more functionalities selected from the group comprising amino, hydroxyl, carboxyl and sulfonate groups.
 28. The method of claim 16 wherein the group of ligands comprises 3,3′,3″-phosphinidynetris(benzenesulfonic acid) and tris(2-carboxyethyl)phosphines and their salts.
 29. A method of controlling the incorporation of a nucleotide as defined in claim 6 and complementary to a second nucleotide in a target single-stranded polynucleotide in a synthesis or sequencing reaction comprising incorporating into the growing complementary polynucleotide said nucleotide, the incorporation of said nucleotide preventing or blocking introduction of subsequent nucleoside or nucleotide molecules into said growing complementary polynucleotide.
 30. The method of claim 29, wherein the incorporation of said nucleotide is accomplished by a terminal transferase or polymerase or a reverse transcriptase.
 31. The method of claim 30 wherein the polymerase is a Thermococcus sp.
 32. The method of claim 31 wherein the Thermococcus sp is 9° N or a single mutant or double mutant thereof.
 33. The method of claim 32 wherein the double mutant is −Y409V A485L.
 34. A method for determining the sequence of a target single-stranded polynucleotide, comprising monitoring the sequential incorporation of complementary nucleotides, wherein at least one incorporation is of a nucleotide as defined in claim 6 and wherein the identity of the nucleotide incorporated is determined by detecting the label linked to the base, and the blocking group and said label are removed prior to introduction of the next complementary nucleotide.
 35. The method of claim 34 wherein the label of the nucleotide and the blocking group are removed in a single chemical treatment step.
 36. A method for determining the sequence of a target single-stranded polynucleotide, comprising: (a) providing a plurality of different nucleotides wherein said plurality of different nucleotides are as defined in claim 6 and wherein the detectable label linked to each type of nucleotide can be distinguished upon detection from the detectable label used for other types of nucleotides; (b) incorporating the nucleotide into the complement of the target single-stranded polynucleotide; (c) detecting the label of the nucleotide of (b), thereby determining the type of nucleotide incorporated; (d) removing the label of the nucleotide of (b) and the blocking group; and (e) optionally repeating steps (b)-(d) one or more times; thereby determining the sequence of a target single-stranded polynucleotide.
 37. The method of claim 36 wherein said incorporating step is accomplished by a Thermococcus sp.
 38. The method of claim 37 wherein the Thermococcus sp is 9° N or a single mutant or double mutant thereof.
 39. The method of claim 38 wherein the double mutant is −Y409V A485L.
 40. The method of claim 36 wherein the label of the nucleotide and the blocking group are removed in a single chemical treatment step.
 41. A method according to claim 36, wherein each of the nucleotides are brought into contact with the target sequentially, with removal of non-incorporated nucleotides prior to addition of the next nucleotide, and wherein detection and removal of the label and the blocking group is carried out either after addition of each nucleotide, or after addition of all four nucleotides.
 42. The method according to claim 36, wherein each of the nucleotides are brought into contact with the target together simultaneously, and non-incorporated nucleotides are removed prior to detection and subsequent to removal of the label and the blocking group.
 43. The method according to claim 36, comprising a first step and a second step, wherein in the first step, a first composition comprising two of the four nucleotides is brought into contact with the target and non-incorporated nucleotides are removed prior to detection and subsequent to removal of the label, and wherein in the second step, a second composition comprising the two nucleotides not included in the first composition is brought into contact with the target, and non-incorporated nucleotides are removed prior to detection and subsequent to removal of the label and blocking group, and wherein the first and second steps are optionally repeated one or more times.
 44. The method according to claim 36, comprising a first step and a second step, wherein in the first step, a composition comprising one of the four nucleotides is brought into contact with the target, and non-incorporated nucleotides are removed prior to detection and subsequent to removal of the label and blocking group and wherein in the second step, a second composition comprising the three nucleotides not included in the first composition is brought into contact with the target, and non-incorporated nucleotides are removed prior to detection and subsequent to removal of the label and blocking group and wherein the first steps and the second step are optionally repeated one or more times.
 45. The method according to claim 36, comprising a first step and a second step, wherein in the first step, a first composition comprising three of the four nucleotides is brought into contact with the target, and non-incorporated nucleotides are removed prior to detection and subsequent to removal of the label and blocking group and wherein in the second step, a composition comprising the nucleotide not included in the first composition is brought into contact with the target, and non-incorporated nucleotides are removed prior to detection and subsequent to removal of the label and blocking group and wherein the first steps and the second step are optionally repeated one or more times.
 46. A kit, comprising: (a) a plurality of different nucleotides wherein said plurality of different nucleotides are as defined in claim 6; and (b) packaging materials therefor.
 47. A kit according to claim 46, wherein the detectable label in each nucleotide can be distinguished upon detection from the detectable label used for any of the other three types of nucleotide.
 48. The kit of claim 46, further comprising an enzyme and buffers appropriate for the action of the enzyme.
 49. (canceled)
 50. A method of using a nucleotide of claim 1 wherein said method includes a Sanger or Sanger-type sequencing method.
 51. A method of controlling the incorporation of a nucleotide as defined in claim 12 and complementary to a second nucleotide in a target single-stranded polynucleotide in a synthesis or sequencing reaction comprising incorporating into the growing complementary polynucleotide said nucleotide, the incorporation of said nucleotide preventing or blocking introduction of subsequent nucleoside or nucleotide molecules into said growing complementary polynucleotide.
 52. A method for determining the sequence of a target single-stranded polynucleotide, comprising monitoring the sequential incorporation of complementary nucleotides, wherein at least one incorporation is of a nucleotide as defined in claim 12 and wherein the identity of the nucleotide incorporated is determined by detecting the label linked to the base, and the blocking group and said label are removed prior to introduction of the next complementary nucleotide.
 53. A method for determining the sequence of a target single-stranded polynucleotide, comprising: (a) providing a plurality of different nucleotides wherein said plurality of different nucleotides are as defined in claim 12 and wherein the detectable label linked to each type of nucleotide can be distinguished upon detection from the detectable label used for other types of nucleotides; (b) incorporating the nucleotide into the complement of the target single-stranded polynucleotide; (c) detecting the label of the nucleotide of (b), thereby determining the type of nucleotide incorporated; (d) removing the label of the nucleotide of (b) and the blocking group; and (e) optionally repeating steps (b)-(d) one or more times; thereby determining the sequence of a target single-stranded polynucleotide.
 54. A kit, comprising: (a) a plurality of different nucleotides wherein said plurality of different nucleotides are as defined in claim 12; and (b) packaging materials therefor.
 55. (canceled) 